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Acid a-Amylase 
Field of the Invention 

The present invention is directed to purified 
genetic and protein sequences encoding the acid a- 
amylase protein of B. acidocaldarius , recombinant 
constructs and hosts transformed therewith that are 
capable of expressing such sequences, and the use of 
such genetic sequences for the expression of acidic 
aamylase, and the use of the recombinant ly-produced 
acid a-amylase protein for industrial and agricultural 
applications such as, for example, starch 
liquefaction, and silage preparation- 

BrAgf Pescription of the fearArn -ound Art 

Lactobacillus and Bacillus strains offer many 
potential advantages in the production of cloned gene 
products, as compared with Escherichia coli. First, 
they are non-pathogenic and do not synthesize 
endotoxins. Second, many of the gene products are 
secreted into the growth mediiim, in contrast to E. 
coli, which retains most of the proteins due to the 
presence of an outer membrane. Third, they have been 
widely used for production of enzymes in large-scale 
fermentation processes. While Lactobacillus is not an 
industrial enzyme producer, it is highly used in the 
food and animal feed processing industry. 

One important highly desirable exoenzyme secreted 
in large^ amounts by Bacilli is a-amylase. a-Amyiase 
is used in a variety of industrial applications, for 
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example, starch liquefaction. However, the industrial 

use of this enzyme is limited in many cases by its 

sensitivity to acidic conditions. That is, most forms 

of the enzyme are not enzymatically stable at acidic ^ 

pH's. 

At least four acidophilic and acid-stable a- 
amylase isozymes have been reported in different 
strains of B. acidocaldarius (Kanno, M. , Agric. Biol, 
Cbem. 50:23-31 (1986); Buonocore, V. et al., J. 
BactBriol^ 128 z 515 (1976); Boyer, E.W. . et al., 
Starcb/StarkB 31:166 (1979); and Uchino, F. , A^ric. 
Biol. G&em. 46:7 (1982). An "acidic" a-amylase is 
characterized as being relatively stable to heat under 
acidic conditions. For example, the acid a-amylase's 
produced by Bacillus acidocaldarius strain A2 retains 
more than 79-90% of its activity after 30 min of 
incubation at pH 2.0 (70*»C) and at pH 4.5 (90<»C) in 
the absence of substrate (Kanno, M. , Agric. Biol. 
Chem. 50:23-31 (1986)* 

a-Amyla^e has also been isolated from B. 
amylollgaefaclens (Ingle et al.^. Adv. Appl. Microbiol. 
241257-278 (1987) ) • This enzyme has an M^-value of 
about 50,000 daltons and has been sequenced (Takkinen 
Bt al.r J. Biol. Cbem 258: 1007-10i3 (1983); Chvmg et 
al., Biocheia. J. 185:387-395 (1980)). The expression 
of this enzyme in B. subtilis has been reported 
("Expression and Regulation of the Bacillus 
amyloligaefaciGns a-amylase gene in B. subtilis 
P.Kallio, Ph«D. dissertation. University of Helsinki, 
1987). 

Acidic amylase sequences are especially desirable 
for industrial purposes, where pH values may fall 
(meaning- the medium becomes more acidic) during a • 
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desired reaction due to the natural catalytic action 
. of the enzymes therein. For example, a process strain 
that possesses an acid a-amylase activity would be 
valuable for the Lactobacillus strains as lactic acid 
bacteria are widely used for preservation of many 
starch containing raw materials (e.g., cereal grains, 
edible roots and crop residues) . 

Although such approaches and applications have 
been highly desired in the art (Applied Environment. 
Microbiol. 55:2130-2137 (1989)), they have not been 
possible to obtain. The main obstacle was the lack of 
an a-amylase enzyme that is not inactivated by the low 
growth pH of Lactobacillus. So far, no genes of 
bacterial origin, coding for an acid a-amylase enzyme, 
have been characterized and the isolation of large 
guanti1:ies of such acid a-amylases from natural 
sources is relatively expensive. This has precluded 
the wide-spread adoption of such enzymes in industry. 
Accordingly^ a need exists for a cost-efficient 
recombinant source of a highly stable acid a-amylase. 

SUMMARY O F THE INVEl^ fOKf 

Recognizing the importance to the feed industry 
of a host that would be capable of providing an acid- 
stable c-amylase activity during the food preseirvation 
period^ and cognizant of the lack of hosts capable of 
providing such enzymatic activity in the currently 
utilized processes, the inventors have investigated 
the properties of the acid a^-amylase^ from B. 
acidocaldarius . The inventors desired to generate a 
new Lactobacillus host, such host having a novel 
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property thai: would be advantageous in the food and 
feed industry* 

These efforts have culminated in the development 
of novel recombinant forms of acid a-stmylase, hosts 
expressing such acid a-zunylase, and highly improved 
processes for the preservation of food utilizing such 
hosts . 

According to the invention, there are first 
provided a polynucleotide, such polynucleotide 
providing the coding sequence of an acid ce-amylase. 

According to the invention, there are also 
provided recombinant vectors, such vectors providing 
a host expressible form of acid a-amylase. 

According to the dlnvention, there are fxirther 
provided host cells transformed with polynucleotides 
and/or vectors that are capable of expressing 
recombinauit forms of acid a-amylase. 

According to the invention, there are further 
provided methods for producing the genetically 
engineered or recombinant protein acid a-amylase of 
the invention using such hosts. 

According to the invention, there are further 
provided methods for the utilization of the acid a- 
amylase enzyme of the invention. 

DESCRTPTION OF THE FIGURES 

Figure 1. SDS-PAGE of the (NH^)2S0^ precipitated 
proteins. Lanes 1 and 2 represent Coomassie Brilliant 
Blue R stedLned gel and zymogram, respectively. The 
arrows show the positions of the bands corresponding 
amylase • activity • Molecular weight markers 

(Pharmacia) are indicated on the left. 
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Figure 2. Effect of pH on the activity of B. 
acidocaldarius amylase (s) . Enzyme activity in 100 /xl 
of the 0.5 M sodiiim acetate, pH 5.0 eluted proteins of 
B. acidocaldarius cultures was measured using Phadebas* 
amylase test at 60 in 0.1 M citric acid, 0.2 M 
Na^HPO^. 

Figure 3. The nucleotide sequence of the B- 
acidocaldarius acid a-amylase gene. The putative 
-35 and -10 regions (bases 126 and 148, respectively) 
of the amylase promoter aure underlined. The potential 
signal sequence cleavage site (base 273) is indicated 
by an arrow. The N-terminal sequence (base 1594) of 
the 90 kd protein is underlined with a dotted line. 
The two alternative N-termini of the signal peptide 
indicated by cui asterisk. 

DESCRIPTT QN OF THE PREFERRED EMBQDTMENTS 
I. Definitions 

In the description that follows, a niuaber of 
terms used in recombinant DNA (rDNA) technology are 
extensively utilized. In order to provide a clear and 
consistent understsuiding of the specification and 
claims, including the scope to be given such terms, 
the following definitions are provided. 

Process strain, a "process strain" is a 
bacterial strain that causes or contributes to a 
desired change in a biotechnical process. - 

Biotechnical process, a biotechnical process 
is a process that depends upon the production of 
bacteria^ or biomass for the achievement of a desired 
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chemical t:ransfon&at:ioh of the process mediuia, such 
bacteria or biomass providing a component or 
environment necessary for such conversion. 

Fodder . "Fodder" is anything fed to 
domesticated animals, and especially, cocurse food for 
cattle^ horses or sheep. 

Forage. "Forage" is food for domesticated 
animals that is taken by browsing or grazing. 

Silage , "Silage" is- fodder converted into 
succulent feed for livestock through processes of 
anaerobic acid fermentation (as occurs in a silo). 

Gene , A DNA sequence containing a template for 
a SNA polymerase. The RNA transcribed from a gene may 
or may not code for a protein, RNA that codes for a' 
protein is termed messenger RNA (mRNA) . 

A "complementary DNA" or "cDNA" gene includes 
recombinaoit genes synthesized by reverse transcription 
of mRNA and from which intervening sequences 
(introns) have been removed. 

Genetic sequence. As used herein, 'the term 
"genetic sequences** is intended to refer to a nucleic 
acid molecule (prefereO^ly DHA) . Accordingly, "acid a- 
amylase genetic sequences" are DNA or BMA sequences 
that possess a nucleotide sequence that encodes the 
acid a-amylase protein. 

Promoter, The term **promoter" as used herein 
refers to a module or group of modules which, at a 
minimum, provides a binding site or initiation site 
for RNA polymerase action that is sufficient to 
initiate transcription of the operably linked in a 
desired host, A promoter is generally composed of 
multiple operably linked genetic elements termed 
herein "modules. *' 
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Promote r Module. The term "module" as in 
"promoter module" refers to a genetic transcriptional 
regulatory element which provides some measure of 
control oyer the transcription of operably linked 
coding sequences or other operably linked modules. 

Each module in a promoter can convey a specific 
piece of regulatory information to the host cell's 
transcriptional machinery. At least one module in a 
promoter functions to position the start site for RNA 
synthesis. other promoter modules regulate the 
frequency of transcriptional initiation. Typically, 
modules which regulate the frequency of 
transcriptional initiation are located upstream of 
(i.e., 5* to) the transcriptional start site, although 
such modules may also be found downstream of (i.e., 3\ 
to) the start site. 

The term "target module," as used herein, refers 
to a transcriptional regulatory element which confers 
the ability to respond to enhamcer gene activity 
(i^e., such as the protein or peptide encoded by an 
enhancer gene) on a promoter which otherwise would not 
respond, or would respond less efficiently, to such 
enhancer gene activity. 

The term "initiation module" refers to a promoter 
module which is required to initiate transcription of 
operably linked genes with RNA polymerase. In 
prokaryotic promoters, initiation modules are usually 
located at about -lo and -35 nucleotides from the 
Btart site of transcription. 

By "hybrid promoter" is meant a promoter in which 
an initiation module is operably linked to a 
heterologous target module. A tsurget module which is 
heterologous to . an initiation module is a target 
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module which is not found naturally operably linked to 
this initiation module in the host cell. 

Operable 1 inkaae > An "operable linkage" is a 
linkage in which a sequence is connected to another 
sequence (or sequences) in such a way as to be capable 
of altering the functioning of the sequence (or 
sequences) - For example, a protein encoding sequence 
which is operably linked to the hybrid promoter of the 
invention places expression of the protein encoding 
sequence under the influence or control of the regula- 
tory sequence • Two DNA sequences (such as a protein 
encoding sequence and a promoter region sequence 
linked to the 5' end of the encoding secpience) are 
said to be operably linked if induction of promoter 
function results in the transcription of the protein 
encoding sequence mENA and if the nattire of the 
linkage between the two DMA sequences does not 
(1) result in the introduction of a frame-shift 
mutation, (2) interfere with the ability of the 
expression regulatory sequences to direct the esqpres- 
sion of the mRNA or protein. Thus, a promoter region 
would be operably linked to a DNA sequence if the 
promoter were cap2J)le of effecting transcription of 
that DK& sequence. 

Cloning vector , A "cloning vector'* is a plasmid 
or phage DNA or other DNA sequence which is able to 
replicate autonomously in a host cell, and which is 
characterized by one or a small number of endonuclease 
recognition sites at which such DNA secpxences may be 
cut in a determinetble fashion without loss of an 
essential biological function of the vector, and into 
which DNA may be spliced in order to bring about its 
replication and cloning. The cloning vector may 
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further contain a marker suitable for use in the 
identification of cells transformed with the cloning 
vector. Markers, for example, are erythromycin and 
kanamycin resistance. The term "vehicle" is sometimes 
used for ••vector." 

Expression vector. An "expression vector" is a 
vector similar to a cloning vector but is capable of 
expressing a structural gene which has been cloned 
into the expression vector; after transformation of 
the expression vector into a host. In an expression 
vector, the cloned structural gene (any coding 
sequence of interest) is placed under the control of 
(i.e., operably linked to) certain control sequences 
which allow such gene to be expressed in a specific 
host. In the expression vector of the invention, a 
desired structural gene is operably linked to the 
hybrid promoter of the invention. Expression control 
sequences will vary, and may additionally contain 
transcriptional elements such as teinaination sequences 
and/or translational elements such as initiation and 
termination sites. 

The expression vectors of the invention may 
further provide, in an expression cassette other than 
the one . providing the hybrid promoters of the 
invention, sequences encoding a desired enhancer gene. 
In a preferred embodiment, such enhancer gene would be 
the enhancer gene which encodes the protein which 
regulates the target module of the hybrid promoter. 

Functional Derivative, a "functional derivative" 
of a molecule, such as a nucleic acid or protein, is 
a molecule which has been derived from a native 
molecule;, and which possesses a biological activity 
(either 'fxinctional or structural) that is substan- 
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tially similar to a biological activity of the native 

molecule, but not identical to the native molecule. By ^ 

a functional derivative of a nucleic acid sequence 

(herein the "second sequence") that encodes a protein ^ 

(herein a "first" sequence) is meant (1) a nucleic 

acid sequence that does not possess the exact same 

nucleotide sec[uence as the first sequence but which 

encodes the same amino acid secpience; and, a nucleic 

acid sequence that does not encode the same amino acid 

sequence but »hich encodes a protein with biochemical 

properties (such as, for example, enzymatic stability 

in acidic conditions) equivalent to (i.e., the same as 

or slighly different from) the protein encoded by the 

first nucleic acid sequence* 

A functional derivative of a protein is a protein 
that retains a desired function of the "parent" 
protein from which it is derived. A ftinctional 
derivative of a protein may or may not contain post-* 
treuislational modifications, such as covalently linked 
C2urbohydrate, depending on the necessity of such 
modifica-kions for the performance of a specific 
function* The term "functional derivative" is 
intended to include the '^fragments,** "variants," or 
n<^emical derivatives" of a molecule. 

As used herein, a molecule is said to be a 
"chemical derivative" of another molecule when it 
contains additional chemical moieties not normally a 
part of the molecule. Such moieties may improve the 
molecule's solubility, absorption, biological half 
life, etc. The moieties may alternatively decrease 
the toxicity of the molecule, eliminate or attenuate 
any undesirable side effect of the molecule, etc. 
Moieties capable of mediating such effects are • 
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disclosed in Remington's Pharmaceutical Sciences 
(1980) • Procedures for coupling such moieties to a 
molecule are well knovm in the art. 

fragment. A "fragment" of a molecule such as a 
nucleic acid or protein is meant to refer to a mole- 
cule which contains a portion of the complete sequence 
of the native molecule. 

Variant > A "variant" of a molecule such as a 
nucleic acid or protein is meant to refer to a mole- 
cule substantially similar in structure and biological 
activity to either the entire molecule^ or to a 
fragment thereof, but not identical to such molecule 
or fragment thereof • A variant is not necessarily 
derived from the native molecule itself. A gene allele 
is an example of what is meant by a variant nucleic 
acid sequence and an enzymatic isozyme is an example 
of what is meant by a variant enzyme sequence, Thus^ 
provided that two molecules possess a similar 
biological activity or function that characterizes 
molecules as a •family', they are considered variants 
as that term is used herein, even if the composition or 
secondary, tertiary, or quaternary structure of one of 
the molecules is not identical to that found in the 
other, or if the sequence of nucleic acid (or amino 
acid residues) is not identical, or if the synthesis 
of one of the variants did not derive from the other. 
Thus the invention intends to include all variants 
(genomic alleles and enzymatic isozymes) of the acid 
a-amylase of the invention. 
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II- Genetic Engineering of Acid oe-amylase 

Prior to the invention, the cloning of the acid 
amylase gene was not thought to be possible due to the 
lack of cross-reacting antibodies, the lack of 
sufficient protein to obtain such antibodies, the lack 
of a reliable probe and the lack of protein sequence 
data from which to derive such probe. 

Xhe available antisera or gene probes of other 
known bacterial amylases (for example, those of B. 
amylolxquBfaclens strain A2 as described by Kanno, M. 
Agric. Biol. Cbem. 50:23-31 (1986), B. llcheniformis 
cU5 described by Siato, K., Arch. Biodiem. Bipbys, 
255:290 (1973) and B. subtilis) does not provide any 
crossreactivity or positive hybridization when tested 
with material derived from B« acLdocsLldarius . When the 
cuuylase preparation from B« amylollquBfacxens was 
analyzed by kymography, it revealed an amylase band of 
molecular weight 150,000-200,000 daltons. This is 
much higher than previously reported for other 
bact^ial amylases. Due to the very low initial yield 
and large size of the acid amylase, it was not 
possible to determine the K-terminal sequence of this 
protein by standard protein chemistry methods in order 
to obtain an oligonucleotide probe. 

Thus it was necessaury to identify clones to the 
B. acldalcaritxs acid as^lase without having available 

(1) antibodies that cross-reacted with the protein or 

(2) Protein or DNA sequence information upon \^ch to 
design a oligonucleotide probe. 

To solve this problem, a new method for the 
identification of acid amylase protein and clones was 
developed • A 90,000 dalton peptide was found in 
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protein extracts of B. acxdocaldarius , that had a very 
low specific amylase activity. An assumption was made 
that this peptide would be sufficiently related to the 
large acid amylase protein to allow selection of 
clones to the acid amylase protein, using probes 
designed from the sequence of the smaller peptide. 
This was not a risk-free assumption as the low level 
"amylase" activity displayed by the 90,000 dalton 
peptide could easily have been a side activity of a 
different type of enzyme, and not related to the acid 
amylase enzyme of the invention per se. Thus^ there 
was no way of confirming that the assumption upon 
which the cloning of the enzyme of the invention was 
based was correct until the entire cloning and 
sequencing was complete. 

The process for genetically engineering the acid 
q-amylase sequences of the invention is facilitated 
through the cloning of genetic sequences which are 
capable of providing specific protein encoding 
sequences. Genetic sequences which are capable of 
providing protein encoding sequences may be derived 
from genomic DNA, synthetic DNA, cloned DNA and 
combinations thereof. The preferred species source of 
the acid a-anylase of the invention is J3. 
aciclocaldarius , although any source of an acid a- 
amylase may be used* 

Genetic (protein encoding) genomic DNA will not 
contain introns in prokaryotes, although it may 
contain spacers between transcriptional units. As 
outlined below, such genomic DNA may be obtained in 
association with the 5» promoter region and/or the 3« 
trsuiscriptional termination region if desired. 
Further/ such genomic DNA may be obtained in 
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associat:ion vit:h tixe genetic seq[uences which encode a 
5' non-txanslated region of the desired mRNA and/or 
with the genetic sequences which encode the 3 ■ non- 
translated region. To the extent that a host cell can 
recognize the transcriptional and/or translational 
regulatory signals associated with the expression of 
the mRNA and protein, and to the extent that such 
signals do not impede the expression of the 
recombinant acidic aamylase of the invention, then the 
5' and/or 3" non-transcribed regions of the native 
gene, and/or, the 5« and/or 3" non-tremslated regions 
of the mRNA, may be retained and employed for 
transcriptional and translational regulation. 

The coding sequences for the acid a-aunylase of 
the invention is not rearranged by the native 
bacterial host prior to expression in such host. To 
obtain coding sec[uences for proteins whose genes are 
not rearranged prior to expression, genomic DNA can be 
extracted cind purified from any cell of any host which 
aaxrxBs the coding sequence, whether or not the cell 
e3cpresses the protein. Such extraction of genomic DNA 
can be performed by means well Icnown in the art (for 
exanxple, see Guide to Molecular Cloning Techniques, 
S.L. Berger et al., eds.. Academic Press (1987)). 

Alternatively, nucleic acid sequences that encode 
a desired protein can also be obtained by DNA 
transcribed from mRNA specific for that protein. mRNA 
can be isolated from any cell which produces or 
expresses the protein of interest and used to produce 
cDNA by mezms well known in the art (for example, see 
Guide to Molecular Cloning Techniques, S.L. Berger et 
al., eds.. Academic Press (1987)). Preferably, the 
mRNA preparation used will be enriched in mRNA coding 
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for the desired protein, either naturally, by 
isolation from a cells which are producing large 
amounts of the protein, or in vitro, by techniques 
commonly used to enrich mRNA preparations for specific 
sequences, such as for example, gel electrophoresis, 
sucrose gradient centrifugation. 

To prepare DMA for cloning into a cloning vector 
or an expression vector, a suitable DNA preparation 
(either genomic DNA or cDNA) is randomly sheared or 
enzymatically cleaved, respectively. Such DMA can then 
be ligated into appropriate vectors to form a 
recombinant gene (either genomic or cDMA) library. 

A DNA sequence encoding a protein of interest or 
its functional derivatives may be inserted into a 
cloning vector or an expression vector in accordance 
with conventional techniques, including blunt-ending 
or staggered-ending termini for ligation, restriction 
enzyme digestion to provide appropriate termini, 
filling in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid xmdesirable joining, 
and ligation with appropriate ligases. Techniques for 
such manipulations are disclosed by Mahiatis, T., et 
al.. Molecular Cloning (A Laboratory Manual), Cold 
Spring Harbor Laboratory, second edition, 1989), and 
are well known in the art. 

Libraries containing clones encoding a desired 
protein or a desired transcriptional regulatory 
element may be screened and a desired clone identified 
by any means which specifically selects for the DNA of 
interest. For example, if a clone to an acid o- 
asq^lase is desired, such a clone may be identified 
any meaxis used to identify acid a-amylase protein or 
mRNA, including, for example, a) by hybridization with 
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an approprialie nucleic acid probe (s) containing a 
seguence(s] specific for the DNA of this protein, or 
b) by hybridization-selected translational analysis in 
which native mRNA that hybridizes to the clone in 
c[uestion is translated in vitro and the translation 
products are further characterized, or, c). if the 
cloned genetic sequences are themselves capable of 
expressing mRNA, by immimoprecipitation of a 
translated protein product produced by the host 
containing the clone. 

Oligonucleotide probes specific for a desired 
protein can be used to identify a desired clone. Such 
probes can be designed from knowledge of the amino 
acid sequence of the desired protein. The sequence of 
amino acid residues in a peptide is designated through 
the use of the commonly employed three-letter or 
single-letter designations. A listing of these three- 
letter and one-letter designations may be found in 
textbooks such as JBiochemistry, Lehninger, A. , Worth 
Publishers, New York, NY (1970) • As used herein, when 
the amino acid sequence is listed horizontally, the 
amino terminus is intended to be on the left end and 
the caxhascy terminus is intended to be at the right 
end. 

Because the genetic code is degenerate, more than 
one codon may be used to encode a psurticular amino 
acid. The peptide fragments are analyzed to identify 
secpxences of amino acids that may be encoded by 
oligonucleotides having . the lowest degree of 
ciegeneracy. This is prefer2d)ly accomplished by 
identifying sequences that contain amino acids that 
are encoded by only a single codon. 
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Although occasionally an amino acid sequence may 
be encoded by only a single oligonucleotide sequence, 
frequently the amino acid sequence may be encoded by 
any of a set of similar oligonucleotides. Impor- 
tantly, whereas all of the members of this set contain 
oligonucleotide sequences that are capable of encoding 
the same peptide fragment and, thus, potentially 
contain the same oligonucleotide sequence as the gene 
which encodes the peptide fragment, only one member of 
the set contains the nucleotide sequence that is 
identical to the exon coding sequence of the gene. 
Because this member is present within the set, and is 
capable of hybridizing to DNA even in the presence of 
the other members of the set, it is possible to employ 
the unf ractionated set of oligonucleotides in the same 
manner in which one would employ a single oligo- 
nucleotide to clone the gene that encodes the peptide. 

Therefore, using the genetic code, one or more 
different oligonucleotides can be identified from the 
amino acid sequence, each of which would be capable of 
encoding thB desired protein. The probability that a 
particular oligonucleotide will, in fact, constitute 
the actual protein's encoding sequence can be 
estimated by considering abnormal base pairing 
relationships and the frequency with which a 
particulEo: codon is actually used (to encode a 
particular amino acid) in the host cell. Such "codon 
usage rules" are disclosed, for example, by Lathe, R. , 
et al., J. Wolec. BioX. 183:1-12 (1985). Using the 
"codon usage rules" of- Lathe, a single oligonucleotide 
sequence, or a set of oligonucleotide sequences, that 
contain ^ a theoretical "most probable" nucleotide 
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sequence capable of encoding tihe acid a-amylase of tihe 
invention is identified. 

In addition to the codon usage rules, 
oligonucleotide design can may utilize the use of 
deoxyinosine at ambiguous codon positions. This 
approach is particularly useful when the required DNAs 
sequence is derived from a poorly characterized 
organism like B. acidocaldarius (TakcOiaski et al., 
Proc. Natl. Acad. Scl. USA 82:1931-1935 (1985)). 

The suitable oligonucleotide, or set of 
oligonucleotides, that are capable of encoding a 
fragment o£ the desired gene (or which is 
compleraentea^ to such an oligonucleotide, or set of 
oligonucleotides) may be synthesized by means well 
known in the ax± (see, for example. Synthesis and 
Application of DNA and ENA, S.A. Narang, ed., 1987, 
Academic Press, San Diego, CA) and employed as a probe 
to identify and isolate the cloned gene by techniques 
known in the art. Technicpies of nucleic acid 
hybridization and clone identification are disclosed 
by Uaniatis, T. , et al. (In: Molecalar Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratories, 
Cold Spring Harbor, NY (1982)), and by Hames, et 
al., in: Nucleic Add Eybridlzatlon, A Practical 
Approach, IRL Press, Washington, DC (1985)), which 
references are herein incorporated by reference. 
Those members of the above-described gene library 
which are found to be capaible of such hybridization 
are then suialyzed to determine the extent and nature 
of the sequences which they contain. 

To facilitate the detection of the desired 
encoding sequence, the above-described DNA probe may 
be lcd3eled with a detectable group. Suc^ detectea>le 



wo 92/19744 



PCr/FI92/00138 



-19- 

group can be any material having a detectable physical 
or chemical property. Such materials have been well- 
developed in the field of nucleic acid hybridization 
and in general most any label useful in such methods 
can be applied to the present invention. Particularly 
useful are radioactive labels, such as ^^P, ^h, ^*C, ^^S, 
^^Ir or the like. Any radioactive label may be 
employed which provides for an adequate signal and has 
a sufficient half -life. The oligonucleotide may be 
radioactively labeled by laeans well-known in the art, 
for example, "nick-translation" and T4 DNA polymerase 
replacement synthesis. 

Alternatively, polynucleotides iare also useful as 
nucleic acid hybridization probes when labeled with a 
non-radioactive marker such as biotin, an enzyme or a 
fluorescent group. 

Thus, in summary, the actual identification of 
acid a-amylase peptide sequences permits the 
identification of a theoretical "most probable" DNA 
sequence, or a set of such sequences, capable of 
encoding such acid cr-amylase. By constructing am 
oligonucleotide complementary to this theoretical 
sequence (or by constructing a set of oligonucleotides 
complementary to the set of "most probable" oligo- 
nucleotides) , one obtains a DNA molecule (or set of 
DNA molecules), capable of ftmctiondLng as a probe (s) 
for the identification and isolation of clones 
directed to acid a-amylase. 

The above discussed methods are, therefore, 
capable of identifying genetic sequences' that are 
capable of encoding acid a-amylase, or fragments, 
variants and functional derivatives of acid a-amylase. 
In order to further characterize the cloned acid a- 
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amylase (or its fragment, variant or functional 
derivative) and especially, in order to produce 
recombinamt acid a-amylase, it is desirable to express 
the proteins which the sequences encodes. 

III. Expression of Proteins Using the Expression 

Vectors of the Invention 

Expression of acid a-amylase protein allows the 
identification of clones- that are capable of 
expressing the acid a-amylase protein of the 
invention, or fragments, variants or functional 
derivatives thereof.. Characteristics unique to acid a- 
amylase that may be used to identify the acid a- 
amylase protein, fragment, variant or functional 
derivative include the ability to specifically bind 
acid a-amylase antibodies, the ability to elicit the 
production of acid a-cunylcise antibodies that are 
capable of binding to the native protein, and the 
ability to provide an enzymatic function specific to 
acid a*-amylase, such as, for example, enzymatic 
stability at an acidic pH at which other "non-acidic" 
a-amylase enzymes axe, relatively unstable, among 
others. 

in a preferred embodiment, amylase-negative 
bacterial strsiins are used as hosts. Especially 
preferred are amylase negative mutants of Bacillus or 
JD. plantarum (which does not produce amylase by 
definition) . 

To eatress the acid a-amylase of the invention, 
transcriptional and translational signals recognizable 
by an appropriate host are necessary. Cloned 



sequences encoding acid a-amylase, its fragments, 
variants or functional derivatives, obtained through 
the methods described above, and preferably in a 
double-stranded form, may be operably linked to 
sequences controlling transcriptional expression in an 
expression vector. Such sequences may be introduced 
into a host cell to produce recombinant acid a- 
amylase, its fragments, variants and functional 
derivatives. 

In addition to transcriptional signals, it may 
also be advantageous to use translational and/or 
secretion signals derived f roiq other bacterial sources 
provided that such translational and secretion signals 
are functional in the chosen host cell. This is 
discussed further infra. 

According to the invention, any prokaryote host 
may be utilized. In a preferred embodiment, a member 
of the BacLllvLS or Lactobacillus genera are used as 
the host cell for expressing the desired protein of 
the invention. Such members include B. suhtilis, B. 
lichenijfomils, B. amyloliquefaciens^ B. polymyxa, B. 
steaxTothermophllus, B. theroproteolyticus, B. 
coagulan^f^ B. thuringiBnsis, B. isegaterium, B. cereus, 
B. natto, and, B. acidocaldarius » In an especially 
highly preferred embodiment, the host cell is 
Lactobacillus m 

Lactobacillus species that are associated with 
silage and are expecially preferred as hosts in the 
processes of the invention are L, plantarum, L. 
brevis, Ii. bucbnerie, L. coryniformis, L. cuxvatus, L. 
casei, i. termBntUBf L. acidophilus, and L. 
salivarius^ 
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A nucleic acid molecule, such as ONA, is said t:o 
be "capable of expressing" a a polypeptide like the 
acid a-amylase of the invention if the nucleic acid 
molecule contains expression control sequences that 
provide transcriptional regulatory information and 
such sequences are operably linked to the nucleotide 
sequence which encodes the acid a-amylase. 

The precise nature of the regulatory regions 
needed for gene ea^ression may vary between species or 
cell types, but shall in general include, as neces- 
sary, 5* rion-transcribing and 5« non-translating (non- 
coding) sequences involved with initiation of tran- 
scription and translation respectively. Especially, 
such 5' non-trcuiscribing control sequences will 
include a region which contains a promoter capable of 
functioning in the host cell. 

Expression of a recombinant protein in prokaryo- 
tic hosts recpiires the use of regulatory regions 
functional in such hosts, and preferably prokaryotic 
regulatory systems. A wide variety of transcrip- 
tional and translaliional regulatory, sequences can be 
employed, depending upon the nature of the prokaryotic 
host. Preferably, these regulatory signals are 
associated with a particular gene whicdi is capable of 
a high level of es^ression in the host cell. 

If desired, a fusion product of the desired 
protein may be constructed. For example, if the 
genetic sequence encoding a desired acid a-amylase (or 
its fragment, varicUit or fimctional derivative) does 
not possess a seQ[uence encoding a signal sec[uence 
ftmctional in a certain host, such signal secjuence may 
be operably linked to the desired genetic sequence, 
thus allowing secretion of the protein from, or the 
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membrane coropartmentalization of the protein in, the 
host cell. Such signal sequences may be designed with 
or without specific protease sites such that the 
signal peptide sequence is amenable to subsequent 
removal • Alternatively, the native signal sequence of 
a protein of the invention may be used. 

In addition to using either a native signal 
sequence or a signal derived from some other bacteria, 
it is often useful to construct, a signal sequence 
fusion sequence, utilizing only part of the native 
signal sequence. Potential fusion sites between two 
signal sequences are the hydrophobic regions or the 
helix breaker residue between the hydrophobic region 
and the C-terminal part of the signal seuqence. 
Examples of useful fusion signal sequences are 
described in Applicants' copending application, U.S. 
Application No. 07/377,450, filed July 10, 1989 and 
incorporated herein fully by reference*. 

Transcriptional initiation regulatory signals 
that can be operably linked to the proteins of the 
invention can be selected whicdi allow for repression 
or activation, so that expression of the operably 
linked genes can be modulated in a specific manner. 

Where the native expression control secpiences 
signals do not function satisfactorily in a desired 
host cell, then sequences functional in the host cell 
may be substituted. 

Once the vector or DNA sequence containing the 
construct (s) is prepared for expression, the DNA con- 
struct (s) is introduced into an appropriate host cell 
by any of a variety of suitable means known in the 
art. In a preferred embodiment, Lactobacillus are 
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transformed essentially as described in von Wright et 
al., Appl. Enciron. Microbiol. 55:2029-2035 (1990)- 

To transform a host cell with the DNA constructs 
of the invention many vector systems are available, 
depending upon whether it is desired to insert the 
genetic DNA construct into the host cell chromosomal 
DNA, or to allow it to exist in an extrachromosomal 
form, mien it is desired to maintain a vector in an 
extrachromosomal form the vector should provide an 
origin of replication capable of functioning in the 
host. 

Genetically stable transformants may also be 
constructed with vector systems, or transfoinnation 
systems, whereby a desired protein's DNA is integrated 
into the host chromosome • Such integration may occur 
da novo within the cell or, in a most preferred 
embodiment^ be assisted by tremsfonnation with a 
vector which functionally inserts itself into the host 
chromosome* For example, such vector may provide a 
DNA sequence element which promotes integration of DNA 
sequences in chromosomes. In a preferred embodiment, 
such DNA sequence element is a sequence homologous to 
a sequence present in the host chromosome such that 
the integration is targeted to the locus of the 
genomic sequence and targets integration at that locus 
in the host chromosome. 

Cells that have st2U3ly integrated the introduced 
DNA into their chromosomes are selected by also 
introducing one or more meorkers which allow for 
selection of host cells that contain the expression 
vector in the chromosome, for example the maurker may 
provide biocide resistance, e.g.^ resistance to 
antibiotics, or the like. The selectable msurker gene 
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can either be directly linked to the DNA gene 
sequences to be expressed, or introduced into the same 
cell by co-transformation. 

A sequence may also be incorporated into a 
plasmid or other vector capable of autonomous replica- 
tion in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors 
of importance in selecting a particular vector 
include: the ease with which recipient cells that 
contain the vector may be recognized and selected from 
those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired 
in a particular host; and whether it is desirable to 
be able to "shuttle" the vector between host cells of 
different species. 

In addition to antibiotic selection markers, one 
can (or should, if such antibiotic markers are 
tindesirable in a specific process in which the hosts 
will be utilized) use natural selection markers based 
on, for example, nisin resistance, thymidine 
synthesis, lactose utilization or X-prolyl-dipeptidyl- 
aminopeptidase (J^pl. Environ. Microbiol. 57:38-43 
(1991) . 

After the introduction of the vector, recipient 
cells are grown in a selective medium, which selects 
for the growth of vector-containing cells. Expression 
of the cloned gene sequence (s) results in the 
production of the desired protein. This expression 
can take place in a continuous memner in the 
transformed cells, or in a controlled manner. 

The expressed acid a-amylase, or fragment, 
variant - or fimctional derivative thereof may be 
isolated and purified in accordance with the method 
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described herein and using techniques known in the 
art. However, in a highly preferred embodiment, no 
isolation or purification of the product is needed as 
the host is directly inoculated into the process 
medium. 

The genetic sequences, protein sec^iences, vectors 
Euid methods of the invention are useful in many 
applications of the food and beverage industry. 
Especially, the hosts of the- invention are useful when 
directly inoculated into foodstuff. The lactic acid 
bacteria of the invention may be utilized for the 
preparation of fermented food and beverages as lactic 
acid bacteria play an essential role in the 
preparation of such fermented food and beverages. The 
LactobstcUll of the invention may also be applied in 
silage prepcuration and as probiotics in human and 
animal health. The hosts containing the sequences and 
methods taught herein, together with, if necessary, 
current knowledge of the genetics and biochemistry of 
lactic acid bacteria, especially of Lactococci and 
Lactobacillus, may be used for the construction of 
bacterial argicultural process strains that have been 
optimized for different agricultural and food 
preparation applications using the sequences and 
methods of the invention. The genetics of Lactic acid 
bacteria are known in the art and are reviewed, for 
example, in Biocbimie 70 :No. 3 and No. 4 (1988) and in 
FEMS Microbiol. Rev. 87:No. 1 and No. 2 (1990). 

For example, one optimization using the sequences 
and methods of the invention is the addition of 
heterologous acid a-amylase activities, and especially 
hosts expressing such activities, to any process 
strain. ' Highly desirable hosts that may be utilized 
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as process strain host microorganisms include B. 
subtilis, B. lichoniformis, B. amylolxquefaciens, B. 
polymyxa, B. stearothermophilus, B. 
thermoproteolyticus, B« coagulaiis^ B. thuringlensis, 
B* megateriuiD/ B. cereus^ B. iiatto, and, B* 
acidocaldarius . 

Especially the acid a-amylase activity of the 
invention would be valuable for these strains as 
lactic, acid bacteria are widely used for preservation 
of many starch containing raw materials (e.g., cereal 
grains, edible roots and crop residues) . 

Lactobacillus carrying and expressing the acid a- 
amylase gene of the invention, or a fragment, variant 
or functional derivative thereof, would be also 
extremely useful in food and fodder preparation. Such 
expression may occur under the control of the 
homologous (native) regulatory regions or under the 
control of heterologous promoter and/or signal 
sequences, especially those of Lactococcus or 
IractoJbacllIus origin. 

Additionally, the Lactobacillus hosts of the 
invention, csorrying the acid amylase of the invention 
arB useful for the preservation of feed and fodder 
material rich in starch, e.g., cereal grains (siliaged 
with crimping) , corn (maize) , maize cobs and alfalfa. 

Another specific application of these strains is 
their use in stcirter feeds for calves, in milk 
replacers for calves younger them four weeks (to 
increase digestibility of starch and for probiotic 
action) and in starter feed for piglets. 

Furthermore, these strains can be effectively 
used f or^ preservation of material containing cereal 
and slaughter scraps for fodder use. 
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The examples below sure for illtistrative purposes 
only and are not deemed to limit the scope ot the 
invention. 

EXAMPLES 

Example 1 

Isolation of the Acid ce-Amvlase Enzvme 
from Bacillus acidocaldarins 

Bacillus acidocaldarius (B. ac.) strain ATCC 
27009 was grown on agar plates containing (per liter) : 
1 g yeast extract^ 0.2 g (im^)^SQ^f 0.5 g MgS0^-7H20, 
0.25 g CaCl2*2B^O, 0.6 g K^PO^ and 2.5 g maltose^ pH 
4.5. To isolate the a-an^lase enzyme, the entire 

surface of the agar plate was inoculated with ac. 
cells and the plates were incubated 3-5 days at 55 ^C. 
The cultures were then suspended in 0.5 H sodium 
acetate, pH 5.0, and the cells were removed by 
centrifugation at 8000 g for 20 min. The supernatant 
was centrifuged again at 40,000 g for 30 min and 
proteins in 50 ml of the cleared growth medium were 
precipitated at 0«C by slow addition of (NH^jjSO^ to a 
final concentration of 70% (w/v) . After allowing the 
precipitate of the proteins to form for 30 min at 0«C, 
the precipitated proteins were collected by 
centrifagation at 10,000 g for 20 min. The pellet was 
dissolved in 3 ml of 20 mH BisTris, pH 5.8, and 
applied onto a Bio-Gel P-200 (Sigma, Richmond, USA) 
column (1.5 X 45 cm). The elution was carried out in 
the same buffer. Rapid screening for amylase activity 
in the eluted fraction was performed using plate 
assay. -The assay plates contained 1*5% agar, 0.2% 
starch, 20 nM CaCl2r 50 ibM sodium acetate, pH 5.0. 
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Samples were applied into wells made in agar and, 
after incubation for 2-10 hours at 55«c, the enzyme 
activity was detected by spreading 0,01 M I2/O.OIM KI 
solution onto the plates. The enzyme activity can be 
detected as a halo around the agar well. For 
quantitative assay of the acid a-amylase activity, the 
Phadebas® amylase test (Pharmacia) was used. 

The amylase containing fractions from Bio-Gel P- 
200 were pooled and concentrated by ultrafiltration in 
a Novacel l^-omegacell"* apparatus ( Piltron , 
Northborough, Massachusetts, USA)* The concentrate 
was rechromatographed in a Superose 12 HR 10/30 column 
(Pharmacia, Uppsala, Sweden) in 20 mM Bis-Tris, pH 
5.8. The amylase containing fractions were 

concentrated as above by ultrafiltration and stored at 
-20«C. 

To detect the amylase activity in SDS- 
polyacrylamide gels, zymography was used (Harris, H. 
et al.. Handbook of Enzyme Electro^orBsis in Human 
Genetics, North-Holland, Amsterdam, and. Methods in 
Enzymology 22:578 (1971). After SDS electrophoresis, 
the gel was washed for 15 min in 50 aM sodium acetate, 
pH 5.0 and put onto an amylase assay plate. After 
incubation at 55 for 10 hours the enzyme activity 
was detected as above. When the amylase (s) produced 
by B. ac. strain ATCC 27009, isolated and partially 
purified as described sO^ove, were analyzed by SDS-PAGE 
and zymography, two forms with relative moleculsa: 
weights of 90,000 and 160,000 were detected (Figure 
1) . Similar results were obtained when the SDS-PA6E 
preceding zymography was performed under nonreducing 
conditions. This suggests that the 160 kd form was 
not a disulfide linked dimer of the 90 kd form. Under 
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the conditions in zymography the 160 kd form had 
apparently much higher specific activity (Figure 1) . 

To assay the amylase activity after SDS-PAGE by 
zymography, the SDS-gel was first incubated in 20 mM 
CaCl2# 50 mM sodium acetate, pH 5.0 for 15 min at room 
temperature. AFter this incubation, the SDS-gel was 
placed on an agarose gel (3mm think, 1.0% agar, 0.2% 
starch, 20 mH CaCl^, 50 vM sodium acetate, pH 5.0} . The 
gels were then incubated at 55 oc for 10 hours. To 
reveal the amylase activity, the agarose gel was 
stained by spreading O.OIM I2/O.OIH KI solution onto 
the gel* A clear halo in the dark background 
indicated the amylase activity. 

In chromatofocusing (Huchthens, T.W« in: Protein 
Purification, J.C- JcUison et al., eds., V.H.C. 
Publishers Inc., New York (1989)) and isoelectric 
focusing (der Lsui et al. , in: Gel Electrophoresis of 
Proteins f B.D. Hames et al.,. eds., IRL Press, Oxford) 
followed by zymography both the 160 kd and 90 kd 
polypeptides migrated as a broad peak and band, 
respectively, at pi 4.8« When the pH optimum of the 
enzyme (s) was meaisured, a symmetric curve with a 
relatively sharp pH optimum at pH 5 was obtained. The 
enzyme activil^ was about four fold at 60^C to that at 
370c (Figure 2) . The above data suggests that the 90 
kd polypeptide might be a degradation product of the 
160 kd protein. 
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Example 2 

N"Terminus Sequencing of t:he Acid g-Amvlase Gene 

To secpience the acid a-amylase enzyme, SDS-PAGE 
was performed to separate the 90 kd and 160 kd 
polypeptides from the partially purified culture 
supernatant. After SDS-PAGE the gel was treated with 
1 M KCl to visualize protein bands and the band 
corresponding the enzyme activity was- excised* From 
the gel the protein was electroeluted iising an ISOO 
model 1750 electrophoretic concentrator as described 
in Advanced Methods in Protein Microsequence Analysis, 
Wittmaxm-Liebold, B., Salnikow, J. and Erdmam, V.A. , 
eds., pp, 194-206, 1986, Springer-Verlag Berlin 
Heidelberg. Only the 90 kd polypeptide could be 
purified in an amount required for NH2-terminal 
sequencing. For NI^«-terminal amino acid sequence 
analysis acid amylase was degraded in a gas/pulsed 
liquid sequencer («7. Prot. Chem. 7:242-273 (1988)) 
after electrophoretic transfer onto a polyvinylidene 
difluoride membrane Speibher, D.W. in:atechnigues in 
Protein Chemistry, T.E.Hugli, ed.. Academic Press., 
San Diego, CA (1989) « fOien the 90 kd protein 
electrotransferred onto a polyvinyl difluoride 
membreme was degraded, a single amino acid sequence of 
NH^-Asp-Ile-Asn-Asp-Tyr was obtained. 

For* the cloning of the cozrresponding acid a- 
amylase gene, eight lysylendopeptidase-cleaved 
peptides of the 90 kd protein were purified as 
follows: After electroelution of the 90 kd protein 
from the gel, the eluate was freeze dried and the 
solid material was redissolved in 50 /il of 50 mH 
Tris/HClV pH 9.0. 150 ng at lysylendopeptidase (Wako, 
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Dallas, USA) was added and the mixture was incubated 
at 30 for 18 hours. The resulting peptides were 
sepaurated by reverse phase chromatography on a Vydack 
218 TPB5 (0.46 X 15 cm) column connected to a Varian 
5000 liquid chromatograph. The peptides were eluted 
using a linear gradient of acetonitrile (0-60% in 90 
min) in 0.1% trif luoroacetic acid. The peptides were 
sequenced after application on polybrene (2mg) 
pretreated glass fiber filters. The M-terminal 
sequence of the eic^t purified peptides derived from 
90 kd. protein are shown in Table 1. Three of the 
eight N-terminal sequences (Table 1 no. 2, 3 and 4) 
were used to synthetize oligonucleotides. The 
sequences of these oligonucleotides, used for 
screening of the ac.-gene library, are shown in 
Table, 2. 



TABLE 1 

N-terminal sequences of the lysylendopeptidase 
cleaved peptides of the 90 led protein. 

Peptide Sequence 

1 tyr-aBp-thr-ala-asp-tyr-phe-lys (SEQ ID Ko. 6] 

2 iea--val-8er-a8p-phe--gln-phe-5er^he-pro-^-a8p-pro-thr--il 

[SBQ ID No. 71 . , , , , 

3 iie-aBp-pro-gly-phe-gly-thar-glxi-gln^asp-^ 

[SEQ 3± Ho* 8] 

4 his-ala-val-ile--t:yr-^lu-lle-«et-ipi»-aflp-p 

Ue-ala (SES ID No. 9] 

5 lett-asp-tyr-Ieu-lys [SEQ ID No, 10) ^ , , r««/% 

6 BeE^leu-gly-Tal-a8a-tlir-leuH:yr**leu-met-pix>-v^al-^ne-gta 

ID No. 11] 

7 phe-gly^aBn-phe-hio-fler-aon-gly [SBQ ID No. J2i 

8 gly-lle-tyr-val-gly-ala [SBQ ID No. 13 J 
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Table 2 



Oligonucleotides used for screening of the B«ac, 
- a-amylase gene. Oligos 370, 371 and 372 
coirrespond to peptides 2, 3 and A, respectively. 
"I" is inosine. 



Oligonucleotide 



Sequence 



370 



5* TA GAA GAT IGT IGG GTC III IGG GAA IGA GAA- 
CTG GAA GTC IGA lAC 3* [SEQ ID No. 3] 



371 



5' CA GTC CTG CTG IGT ICC GAA ICC IGG GTC GAT 3* 
[SEQ ID No. 4] 



372 



5» GC lAT GTT ICC GTT ITA GAA IGG GTC IGG CAT 
lAT CTC ITA lAT lAC IGC GT6 3« [SEQ ID No. 5] 



Example 3 

Cloning of the B. acidocaldarius Acid a-Amylase Gene 

ATCG 27009 B. ac. cells were grown in a liquid medium 
containing [per liter] Ig yeast eactract, 0.2g (NH^)2S0^, 0.5 
g HgS0^.7^0, 0«2S g CaCljr 0.6g TSO^O^ and Ig glucose, pH 4*5 
at 55 ^C until late logarithmic phase. The cells were 
collected and the chromosomal DNA was isolated according to 
Marmur, J. Mol; Biol. 3: 208-218 (1986)), except that the 
DNA was dissolved in saline after £irst ^thanol 
precipitation. The chromosomal DNA was pcurtially digested 
with Faelll and after agarose gel electrophoresis a 
fragment .population of 4-6 kb was isolated. The DNA 
fragments were cloned in £• coli using i(-gtlO as the vector 
(Amersham) according to manufacturers recommendation. The 
ac. gene bank in E. coli was screened by plague 
hybridization using oligonucleotides 370 [SEQ ID No. 3]; 
371 [SEQ ID No. 4] and 372 [SEQ ID No. 5] (see table 2} .as 
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radioactive probes. After hybridization a positive clone, 
witb an 4.2 kb insert, was found. The insert was subcloned 
in M13 vectors and pBR322 and sequenced either by M13 
method or using direct plasmid sequencing method. 

The DMA sequence revealed an open reading frame of 
3532 nucleotides starting from the 5* end of the 4.2 Jcb 
insert (see Figure 3 and [SEQ ID. No. 1)) . Within the open 
reading frame, starting from the codon 342, a deduced amino 
acid sequence DXHDY (see Figure 3) can be found. This 
sequence corresponds to the M-terminaJL amino acid sequence 
of the 90 kd protein. The open reading frame downstream of 
codon 342 codes for a 92 kd protein which is in good 
agreement with the purified H^90 led band. Codon 342 is 
preceded by an open reading frame at the 5* end of the 4.2 
kb clone, and there are no sequences resembling either 
promoter or SD-regions in the near vicinity of the N- 
^ft-rmiTiai codon (codon 342). This strongly suggests that 
the 90 kd protein indeed is a breakdown product of a larger 
polypeptide and the regulatory regions of this polypeptide 
are outside of the 4.2 kb insert. Therefore, the B.ac. 
gene bank was rescreened using a 5* fragment of the 4.2 kb 
clone as a probe. This resulted in a new 5.1 kb clone 
which partially overlapped end of 4.2 kb fragment. The 
organization of these two clones was confirmed by Southern 
blotting from the B.ac.-genome and the joint region was 
sequenced also directly from a relevant chromosomal PCR- 
fragment. The 5.1 Ida clone was sequenced 1^ M13 method. 

DMA sequence upstream of the 4.2 kb fragment 
revealed that open reading frame of the 4.2 kb fragment 
continued additional 400 codons. The open reading was 
preceded by a laical promoter - SD-region, and the deduced 
N-termineJ. amino acid sequence showed typical features of 
the bacterid signsa peptide (see Figure 3). The combined 
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open reading frame codes for a protein of 150 kd which is 
in good agreement with the 160 kd band shown in the SDS- 
PAGE - zymography. The deduced amino acid sequence 
revealed sequences that are typical to other amylotic 
enzymes and surprisingly^ a C-terminus that resembles a 
hydrophobic anchor sequence. 

To combine the two parts of the amylase gene the 
relevant DNA fragments of the 5.1 kb and 4.2 kb clones have 
been ligated in to a pHP13 plasmid vector (Hoi. Gen. 
Genetics 209:335-342 (1987)) and transformed in Bacillus 
stibtilis. The DNA analysis of the acid amylase/pHP13 
hybrid plasmid confirmed the intact acid amylase gene was 
stably maintained in JB. siibtllis. This hybrid plasmid was 
designated pKTH2034. 

Example 4 

Express ion of the Acid a-Amvlase Gene in Lactobacillus 
Using the clone described herein, the B.ac* a-amylzuse 
gene is e3q>ressed in JDactoJbacillus host cells. The method 
is described in detail in Examples V, VI, VII and X, of the 
Applicants* copending Application Serial No. 377,450, filed 
July 10, 1989, the specification of which is incorporated 
herein by reference in its entirety as if at forth in full. 

Briefly, plasmids as exemplified by pKTH 1797, pKTH 
1798, pKTH 1799, pKTH 1801, pKTH 1805, pKTH 1806, pKTH 1807 
and pKTH 1809 contain promoter and secretion promoting 
signals. The promoter and secretion promoting sequences of 
these plasmids cam direct the expression of heterologous 
gene and secretion of gene product Gram positive host cells 
such as Bacillus, Lactobacillus and Jtoctococcus^- 

To obtain expression of the acid a-as^lase gene and 
secretion * or "^eacport of the gene product, a host cell such 
as Laictobaclllus • ±e transformed with a plasmid comprising 
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tlie promo-ter and secre'tion proinot:ing sequences described 
above and tbe gene coding for the nature acid amylase 
protein described herein. In addition of using a plasmid 
that is able to replicate in Ifactobacillus host, a 
nonreplicative plasxaid (e.g., pBR322 or pEl94) , bearing 
random fragments of Lactobacillus chromosome, can be used. 
Transformation with nonreplicative plasmid results in 
integration of the acid amylase gene in the Lactobacillus 
chromosome* 

Xhe host containing the amylase gene is cultured in a 
suitable medixna allowing esepression of the protein and the 
protein is recovered from the cell wall layer or culture 
medium or the host cell, producing the a-amylase activity, 
is directly used in the required process. The Laclxybacillus 
host of the invention, modified to express the acid=^- 
an^lase gene, may be utilized for the processing of food 
and feed as in the following examples. 

Example 5 
Proems j,nq of foyaqe 

Forage, for instsmce grass, is harvested by either a 
frail harvester or a precision chopper attached to a wagon 
of a tractor. Preseinration solution is added to the 
chopped forage mass during the harvesting in the 
chopper/harvester using equipment designed for this 
purpose. Th preservation solution is^prepared by adding an 
inoculant (a Lactobacillus strain as described above, 
capable of producing the acid amylase of the invention) , to 
a bottle or tauik which contains half of its - volume as 
lukewarm water. This water should not be so hot as to kill 
the bacterial After shaking to evenly distrJLbute the 
inoculant, the rest of the volume of the bottle is filled 
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by lukevarm water. Usually fanners use bottles of 30 
liters or tanks of 200 liters. The inoculant can be either 
in liquid form or as a dried powder of the acid amylase- 
expressing hosts of the invention. The inoculant may be 
prepaored in fermentors using whey-based medium and the 
culture is grown to a cell density of approximately 10^ 
cells/ml. Aflzer the culture has reached this density, the 
cells are collected by centrifugation and lyophilized to a 
final density of 10"-10" cells/^ of dry weight. Twenty 
grams of this lyophilized material or equal amount of cells 
in a liquid concentrate is mixed with 30 liters of water as 
described above. Five liters of this diluted solution is 
applied per ton of forage. 

Alternatively, the prepared preservation solution can 
also be added to the forage mass after the wagon has been 
emptied in the silo in which fodder is ensiled. 

Example 6 
Animal Feed Additive 

A concentrate mixture (powdered, granulated or 
pelleted) is manufactured xxsing standard manufacturing 
systems. Each con^onent in the mixture is separately 
weighed and then mixed together before pelleting, for 
instance. An inocul€mt (as described in Example 5) can be 
added or mixed into the mixture as one of the components. 
Ihe concentrate. Including the inoculant, can be fed to 
domestic animals, such as, for example, cows, calves, pigs, 
piglets, etc.'-^ 
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The inoculant can also be fed separately as a 
specially formulated powder, tablet or a liquid product. 
The inoculant is generally included in a carrier 
ingredient, for instance, skim miUc powder. These products 
can be fed to domestic animals directly, as as a component 
together with the other feeds or in the drinking water of 
the animal. 

For each purpose,, an inoculant is manuactured so that 
it contains lo"-10^^ cfu/g dry inoculant powder. The 
inoculant may be used in any concentration which is 
necessary to achieve the desired effect. Especially, when 
used as an additive to the zuiimalsV feed, the inoculant is. 
used at 0.01^0 .02% in the concentrate mixture. 

The advantage of feeding the inoculant of the 
invention to domesticated farm animals such sis the above is 
that by letting the hosts of the invention colonize the 
intestional tract of the animal, the animal is better able 
to digest the silage or fodder that it is fed, as the acid 
stable amylase secreted by such hosts would better degrade 
the ingested foodstuff. 

Tin references cited herein are incorporated herein by 
reference • While this invention has been described in 
detaJ.1 smd with reference to specif ic embodiments thereof, 
it will be apparent to one skilled in the art that various 
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changes and modifications could be made therein without 
departing from the spirit and scope thereof. 
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SEQUENCE I.ISTING 

(1) GENBRAX. IMFORHATIOHs 

(i) APPItlCANTs Teija KoivuLar 

Mervi Sibakov, and 
Illdca Palva 

(ii) TITLE OF INVENTION: Acid a-Amylase 
(ili) NDHBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE lUJDRESS: . ^ „ 

(A) ADDRESSEE! Sterne, Kessler, Goldsteui & Fox 
IB) STREET: 1225 Connecticut Avenue, N.W.^ 
suite 300 

(C) CITY: Washington 

(D) STATES District of Columbia 

(E) OOUNTRYs United States of America 

(F) ZIPS 20036 

(V) COMPUTER READABLE FORM; 

(A) HEDIUK T3fPEs Floppy. Disk 

(B) CQMFUTBR: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARES Ascii 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vli) ATTORNEr/AGENT INFOIWAriON: 

(A) NAME: Uicfaele Ann Cixiibala 

(B) REGISTRATION NUMBER: 33,851 

(C) REFERENCE/DOCKET NUMBER: 1155,0090000 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONES (202) 833-7533 

(2) INFORMATION FOR SEQ ID NO sls — 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4140 bases ^ 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESSs Double Stranded 

(D ) TOPOIXX5Y s Linear 

(li) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CATGTGGGTC GCCTCCTCTG CGATGGGACA TCTCACTTAC ATTTTAACAG TGGAAGGGTG 60 

GGGAAGGGAA CAAGGGTCTT GAGAGAAAAG AAAGGATTCG GACAATGAAC GATGCAAGAG 120 

CACCATTGAC GT0GGGAAG6 CTGAGCGTAT AATCCAGGAC GACAAOGTTA GTGCAATCGT 180 

TTGCACAAGA GGA6GTT6AT GCACGT6AAG TGGACGGGTG CC6CGAGCCT TGCGACGACC 240 
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ATCGTCATGT CTGCAGCGCT TCCGTTCGCG GCGTTTGAAA ACACCCGTTC 0GTCGGCA06 300 

GAACGOGTXC OX^GCGGCGAC OGCGG06GAT AGTACTAACG CCTGGCCCCC GACGGGGG06 . 360 

TCCGCAGGOG GTGGCACAGC AGGAGAGACC TACTOGAATG OCGOIGGAGAT TGTGA606GC 420 

AATGGCTTTC 60GCTGGAAA CGGGGOCAOG GTCGTCCTCG CTGOXSAATGG ACTGAACTTG 480 

AACCXCTOGA GTTTCAACGT TGTGGTATOG AATAGCCTCA AOGGOGTGAT C6ACGTCACA 540 

ACTGATTOOG TGCTAACGGG GCCGAATTCG ATTGCCTTCC ACCTTCCTCC CGGGGATGAC 600 

GGGCTTGGCG GOG6TAOGTA GACGGTCACC GTOGAAAGTC GAGGTGATTC GGTCTOGACG 660 

CCTTCAGGAC AGGGACTTCA GGTCATGCCT TATAOGACGG CCGACACCAT TCAGTGGCAT 720 

GGAATCTACA CATCCGATGG CGOGATGTAC GTGTCAGATC CGAATOOCTC CCCTGGACAA 780 

GAGGT6AOGA TCAGCCTTCG AG0GTACA6C GGGAACCTGA C96AAG6TCAT CCTGAATTGC 840 

TGGGACACXX? CACAAAACAA GGGTTTOCAG GTCGAGATGT CTCC!AGGCAG AAOGTTTGGA 900 

COGTATGAGC TCTGGTCTGC GA06ATCCCC GCTTCAAAOG GOGGAAOGAT CTATTATCGC 960 

TTTGACATAT AC6ATGGCAC CAGrTTTGOG TGTCTCTCAG GTGA066ACT GCACACGTCT 1020 

GAOGACATCA ACAACAATTT CCC6TTGCCC GTGGGGAOGG TCaOGCTTTC GACACTTCAG 1080 

GCGAATOCCG GTGATAOGGT GAOGGTCTCC GACCCTCTAG GTGACTTOGC OGGAAGCCAG 1140 

GATCAAOCCA ATCACAGGGT GATAOOGTTT GTCAACTOGT OGGGCXUUUU: GGCOGCCACA 1200 

GTCAATGGGA OG2VAOGCGAG CTGGAACA6C GTGCA6TTCA GAGTCCCACA GAGCXTTTCCA 1260 

AhCGGCTTGT AT0GC!G70GA GAT06ACA0G GTOGCCAAGG ACG0GGATC6 GGTGGTCAAT 1320 

GTOGAATTGG ACAGGAGTCC GGAGCTTATT GTAGGGOCTC TC0C0G06T6 GATGCAAG06 1380 

TATGCACATG ATTCGTTTCA GGOGTTCTAC CX3AT0G0CTT T0GGAGCC6T GTCCACAGGA 1440 

ACOCCXiATCA CGCTTCGOCT GOGGGCTCOG CTCAGOGTGA AGAGIGOGAC GCTTCGCeTC 1500 

T6GGGGGGA6 OGGATGAGTC AGGGGAiGATC GACCTGCOGA TGGAGAAGCT CCAAATGTOG 1560 

GGAGAOGAGT TGGCSGGAACA AAOOGGCGTG CAGGACaiTCA ACGACTACAC GTGGtGGACG 1620 

6TGACCATCC CTGOGGCX^GA TGTGACCACA CCCGGGACGA TGTGGTATCA GTT0GTGA06 1680 

6AGA0GGACA CTGGCGAGGT GGTCTACTAC GATGACAATG GAGCTCAGCT TCAAGGGOCT 1740 

GGCXSAGGTTG GGTTGTCTTC CGACGGACCG AGCTACCAGA TCAGOGXATA CGAAOGGGGA 1800 

TTTCAGACGC CAGATTGGCT GAAGCACX5CC GTGATCTACG AAATCATGCC GGATCGGTTC 1860 

TACAATGGCA ATATOGCCAC GGAGGA6AAT CCX5AATACGC AAAAGGGGAT TTATGTAGGG 1920 

GCCX5AT6GAA CGGAGTCATT AGGCCCCATC CAGTTCCACG AGAACTGGGA CTCGCCGCCC 1980 

TATGATCOGA ATATTCCTCC GTTATCTGAT CCCAAAATTG CCAGTCTGCG AGGCAATGGC 2040 
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TTGACTTTTT CGGAGGTGAT TTGCAGGGCA TCGAGGATAA GCTGGACTAC 2100 

CTGAAGAGC3C TTGGAGTCAA TAOGCTGTAT CTGATGCXXC TCTTTGAGGC GGAATCCAAT 2160 

CaCAAATATG ACACMOCGA CTATTTCAAG ATTGACCCTG GATTTCGAAC GCAGCAGGAC 2220 

TGGCTGAATC TOGTACAGGC TGCGCAOGCG AAGGGGTTCC ATATCATTCT CGAOGGGGTG 2280 

TTC3GAAGATA C0GGAA6TOA CaWSCGTATAT TTCAACAAGT TCGGGAACTX CCACTCCAAC 2340 

GGTGOGXGGC AGGGGXACCT GAAGAACCAG CCGTCGCTGT CGCCCTACTA CXOGTGCTAC 2400 

6TCTG6ACA6 G6AACACCTC AAAOCCATAC GATT06TGGT TTCAGATCGA CAOGCTGCCA 2460 

CTTACGGACA 0GT06AAC0C CGCCTATCAC CGATTCGTGT ATGG6AGOGA CAACTCAGTC 2520 

G0G06TCTGT G6ATC0GGGA AGGTCOGGAC G6ATCGCGCT TGGACTCG6C C9GACAAOGGG 2580 

AATTTCAACA OGGCATGGTG GGGTGGCTTr CX^CAGGCCG TGAAATOGAT 06ATC3CCAAC 2640 

GCA60GATCA XOGGC3GAGAT CTGGGACftAT GCGAC6AATG ACAAT6GAAC GGATTGGTTG 2700 

ACGGGAT06A CCTTOGACAG TGTAATGAAC TACCAGTTCC GGAACGOCGT GATC3GACTTC 2760 

TTOOGOGGCA OGTA<aAC3GA CGGAAACGTG CAGCACCAC6 CCGTCGAOGC T60GGGATTC 2820 

AACGAGGAAC TGATG06CCT 6TACA6TGAA TATCCTCTGC AGTOGTTCTA CTOGATGATC 2880 

AACCTTGTOG ATTCGCAAGA CACCATGGGG ATCCT6ACCA TCTTGGAGAA 060GC0GCAG 2940 

OCaGGCCATC TMOOGC6CP OCftGCaGGAX 6AGTACAG6C OGTCXCCTGC GGCIGAACAG 3000 

TTGGG^TCG AGAG6CXGAA GCTTGTATCG 6ACTTTCAAT TCAGCTTOOC GGGOGATCCG 3060 

ACCATCTTCX ACS6G06A06A G6GAGGGCIC ACTGGTTATT 06GATCC0CT CAAT06T06G 3120 

ACCTATCOGT GGGACAAOa GAATCTC5GAT CTCCIGAACC ACXACCGCAA GCT06GGGCC 3180 

ATTOGAAACG CXAATCCTGT GCTTCAGACG GGGGATTTCA CGC06TTGTA OGCACAGGGC 3240 

ATGGTGTACG CATTTGCAAG GACX31TTCGG AATGGGOGAG ATGTCTTOGG TCTGCCAGGG 3300 

GAGGATGCCA 06GCCATTGT GGCGATCAAC AATCAGAACC AAGCCATCSIC OGTGAGCATT 3360 

OCGACGGAT6 GGACGGTTGC GGAOGGGTCC ACGATGCTCG AIGAACZGAA CAACCA6T66 3420 

TACAA6GTGC AGAATGGTGG CATTACACTC ACGCTGCAAT C6TATCAAG6 TGCCaTTTTG 3480 

GTGAOGC06A GC6AC6CGCC GATGGCTTAT CTGCAAGAGG AGGATTCTCA 6AAC56A6ATT 3540 

GOGTGGA06C CTGT6CAAGG TGCX31TOGGT TATOGCGTCT GGA6ACAGAA TCCGAATGGA 3600 

CAATGGGTGC CXrrTTGGACC TGTGCXTCCT GCCACGGACT TGA6TGTCAC GGXGGAACGC 3660 

(MGCarMG OGCaAACGTT TGCTGTACAA GCGCTGTTTT CGG06TCTGA TCACX3CCCAG 3720 

TCTCC3GCTGT OG6CACCTAA GACGGTATCG CTTCCCGTCG ATGTGCC06C GGTACGCCTG 3780 

AGTCftGCOGA TTCTTAGTGG TCGTGTG6TT GGAGATC6TG CGATGGTCTC GATCACGCCG 3840 
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GTTTCAGGG6 OGAOGCA6TA TGTGATCTAC CAGAGACAGG GCXSAOGGATC GTATGCTCOG 3900 

GTCG06A0G6 TCTCCACAAG TGGCGATTGC GCAGCTATAG GGGAAGTTCC TGCGCAAG6T 3960 

C0GG€X:AACT OGCCTCAOGC GACGATTOGC GXGACAGTCC COGTACCTGC AGGTTTCTCG 4020 

TOGGTCACCT ACOGCGTGGC TGCGCAAAAC GAAGATGGGC AAGC7GTGAC CAATCGATTG 4080 

ACCXnrATCGC TCTCGAAAAA GT6ATGCCTC GCGAAAAGCG CTAT06GAAT TTTTTOGAGA 4140 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE Ca^OlACTERISTICS : 

(A) LENGTH: 1341 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2s 

Met His Val Lye Trp Thr Gly Ala Ala Ser Leu Ala Thr Thr He Val 
15 10 15 

Met Ser Ala Ala Leu Pro Phe Ala Ala Phe Glu Asn Thr Arg Ser Val 
20 25 30 

Gly Thr Glu Arg Val Leu Ala Ala Thr Ala Ala Asp Ser Thr Asn Ala 
35 40 45 

Ser Pro Pro Thr Gly Ala Ser Ala Gly Gly Gly Thr Ala Gly Glu Thr 
50 55 60 

Tyr Ser Asn Ala Val Gin He Val Ser Gly Asn Gly Phe Arg Ala Gly 
65 70 75 80 

Asn Gly Ala Thr Val Val Leu Ala Val Asn Gly Leu Asn Leu Asn Pro 
85 90 95 

Ser Ser Phe Lye Val Val Val Ser Asn Ser Leu Asn Gly Val He Asp 
100 105 110 

Val Thr Ser Asp Ser Val Leu Thr Gly Pro Asn Ser He Ala Leu His 
115 120 125 

Leu Pro Ala Gly Asp Asp Gly Leu Gly Ala Gly Thr Tyr Thr Val Thr 
130 135 140 

Val Glu Ser Gly Gly Asp Ser Val Ser Thr Pro ser Gly Gin Gly Leu 
145 150 155 160 

Gin Val Met Pro Tyr Thr Thr Ala Asp Thr He Gin Trp Asp Gly He 
165 170 175 
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Tyr Tfar Ser Asp Gly Ala Met Tyr Val Ser Asp Pro Asn Pro Ser Pro 
180 185 190 

Gly 61n Glu Val Thr lie Ser Leu Arg Ala Tyr Ser Gly Asn Leu Thr 
195 200 205 

Lys Val lie Leu Asn Cys Trp Asp Thr Ala Gin Asn Lys Gly Phe Gin 
210 215 220 

Val Glu Met Ser Pro Gly Arg Thr Phe Gly Pro Tyr Gin Leu Trp Ser 
225 230 235 240 

Ala Tfar lie Pro Ala Ser Asn Gly Gly Thr lie Tyr Tyr Arg Phe Asp 
245 250 255 

He Tyr Asp Gly Thr Ser Phe Ala Cys Leu Ser Gly Asp Gly Leu Hia 
260 265 270 

Thr Ser Asp Asp lie Asn Asn Asn Phe Pro Leu Pro Val Gly Thr Val 
275 280 285 

Thr Leu Ser Thr Leu Gin Ala Asn Pro Gly Asp Thr Val Thr Val Ser 
290 295 300 

Asp Pro Val Gly Asp Phe Ala Gly Ser Gin Asp Gin Pro Asn His Thr 
305 310 315 320 

Val He Arg Phe Val Asn Ser Ser Gly Glu Thr Ala Ala Thr Val Asn 

325 330 335 

Gly Thr Asn Ala Ser Trp Asn Ser Val Gin Phe Thr Val Pro Gin Ser 
340 345 350 

Lea Pro Asn Gly Leu Tyr Arg Val Glu He Asp Thr Val Ala I.y8 Asp 
355 360 365 

Ala Asp Gly Val Val Asn Val Glu Leu Asp Arg Ser Ala Glu Leu He 
370 375 380 

Val Gly Pro Leu Pro Ala Trp Met Gin Ala Tyr Ala His Asp Ser Phe 
385 390 395 400 

Gin Ala Phe Tyr Arg Ser Pro Phe Gly Ala Val Ser Thr Gly Thr Pro 
405 410 415 

He Thr Leu Arg Leu Arg Ala Pro Leu Ser Val Lys Sc^r Ala Thr Leu 
420 425 430 

Arg Leu Trp Gly Ala Ala Asp Gin Ser Gly Glu He Asp Leu Pro Met 
435 440 445 

Gin Lys Leu Gin Me^ Ser Gly Asp Glu Leu Ala Gin Gin Thr Gly Val 
450 455 460 

Gin Asp He Asn Asp Tyr Thr Trp Trp Thr Val Thr He Pro Ala Ala 
465 470 475 480 

Asp Val Thr Thr Pro Gly Thr Met Trp Tyr Gin Phe Val Thr Glu Thr 
485 490 495 
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Asp Thr Gly GXn Val Val Tyr Tyr Asp Asp Asn Gly Ala Gin Leu Glu 
500 505 510 

Gly Pro Gly Gin Val Gly Leu Ser Ser Asp Gly Pro Ser Tyr Gin lie 
515 520 525 

Ser Val Tyr Glu Arg Gly Phe Gin Thr Pro Asp Trp Leu Lys His Ala 

530 535 540 

Val He Tyr Glu He Met Pro Asp Arg Phe Tyr Asn Gly Asn He Ala 
545 550 555 560 

Thr Glu Glu Asn Pro Asn Thr Gin Lys Gly He Tyr Val Gly Ala Asp 
565 570 575 

Gly Thr Glu Ser Leu Gly Pro He Gin Phe His Glu Asn Trp Asp Ser 
580 585 590 

Pro Pro Tyr Asp Pro Asn He Pro Pro Leu Ser Asp Pro Lys He Ala 

595 600 60S 

Ser Leu Arg Gly Asn Gly Gin Trp Asn He Asp Phe Phe Gly Gly Asp 
610 615 620 

Leu Gin Gly He Glu Asp Lys Leu Asp Tyr Leu Lys Ser Leu Gly Val 
625 630 635 640 

Asn Thr Leu Tyr Leu Met Pro Val Phe Glu Ala Glu Ser Asn His Lys 
645 650 655 

Tyr Asp Thr Ala Asp Tyr Phe Lys He Asp Pro Gly Phe Gly Thr Gin 

660 665 670 

Gin Asp Trp Leu Asn Leu Val Gin Ala Ala His Ala Lys Gly Phe His 
675 680 685 

He He Leu. Asp Gly Val Phe Glu Asp Thr Gly Ser Asp Ser Val Tyr 
690 695 700 

Phe Asn Lys Phe Gly Asn Phe His Ser Asn Gly Ala Trp Gin Ala Tyr 
705 710 715 720 

Leu Lys Asn Gin Pro Ser Leu Ser Pro Tyr Tyr Ser Trp Tyr Val Trp 
725 730 735 

Thr Gly Asn Thr Ser Asn Pro Tyr Asp Ser Trp Phe Gin He Asp Thr 
740 745 750 

Leu Pro Leu Thr Asp Thr Ser Asn Pro Ala Tyr Gin Arg Phe Val Tyr 
755 760 765 

Gly Ser Asp Asn Ser Val Ala Arg Val Trp He Arg Glu Gly Ala Asp 
770 775 780 

Gly Trp Arg Leu Asp Ser Ala Asp Asn Gly Asn Phe Asn Thr Ala Trp 
785 790 795 800 



Trp Gly Gly Phe Arg Gin Ala Val Lys Ser He Asp Pro Asn Ala Ala 
805 810 815 



f 
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lie He Gly Glu He Trp Asp Asn Ala Thr Asn Asp Asn Gly Thr Asp 
820 825 830 

Trp Leu Thr Gly Ser Thr Phe Asp Ser Val Met Asn Tyr Gin Phe Arg 
835 840 845 

Asn Ala Val He Asp Phe Phe Arg Gly Thr Tyr Asn Asp Gly Asn Val 
850 855 860 

Gin His Hie Ala Val Asp Ala Ala Gly Phe Asn Gin Glu Leu Met Arg 
865 870 875 880 

Leu Tyr Ser Glu Tyr Pro Leu Gin Ser Phe Tyr Ser Met Met Asn Leu 
885 890 895 

Val Asp Ser Gin Asp Thr Met Arg He Leu Thr He Leu Glu Asn Ala 
900 905 910 

Pro Gin Pro Gly Asp Leu Ser Ala Leu Gin Gin Asp Glu Tyr Arg Pro 
915 920 925 

Ser Pro Ala Ala Glu Gin Leu Gly He Glu Arg Leu Lys Leu Val Ser 
930 935 940 

Asp Phe Gin Phe Ser Phe Pro Gly Asp Pro Thr He Phe Tyr Gly Asp 
945 950 955 960 

Glu Ala Gly Leu Thr Gly Tyr Ser Asp Pro Leu Asn Arg Arg Thr Tyr 
965 970 975 

Pro Trp Asp Asn Gin Asn Leu Asp Leu Leu Asn His Tyr Arg Lys Leu 
980 985 990 

Gly Ala He Arg Asn Ala Asn Pro Val Leu Gin Thr Gly Asp Phe Thr 
995 1000 1005 

Pro Leu Tyr Ala Gin Gly Met Val Tyr Ala Phe Ala Arg Thr He Arg 
1010 1015 1020 

Asn Gly Arg Asp Val Phe Gly Val Pro Ala Glu Asp Ala Thr Ala He 
1025 1030 1035 1040 

Val Ala He Asn Asn Gin Asn Gin Ala He Thr Val Thr He Pro Thr 
1045 1050 1055 

Asp Gly Thr Val Ala Asp Gly Ser Thr Met Leu Asp Glu Leu Asn Asn 
1060 1065 1070 

Gin Trp Tyr Lys Val Gin Asn Gly Gly He Thr Leu Thr Leu Gin Ser 
1075 1080 1085 

Tyr Gin Gly Ala He Leu Val Thr Pro Ser Asp Ala Pro Met Ala Tyr 
1090 1095 1100 

Leu Gin Glu Glu Asp Ser Gin Asn Glu He Ala Trp Thr Pro Val Gin 
H05 1110 1115 1120 

Gly Ala He Gly Tyr Arg Val Trp Arg Gin Asn Pro Asn Gly Gin Trp 
1125 1130 1135 
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Val Pro Phe Gly Pro Val Leu Pro Ala Thr Asp Leu Ser Val Thr Val 
1140 H45 1150 

Glu Arg Asp Ala Tyr Ala Gin Thr Phe Ala Val Gin Ala Leu Phe Ser 
1155 1160 1165 

Ala Ser Asp His Ala Gin Ser Pro Val Ser Ala Pro Lys Thr Val Ser 
1170 1175 1180 

Leu Pro Val Asp Val Pro Ala Val Arg Leu Ser Gin Pro He Val Ser 
1185 1190 1195 1200 

Gly Arg Val Val Gly Asp Arg Ala Met Val Ser He Thr Pro Val Ser 
1205 1210 1215 

Gly Ala Thr Gin Tyr Val He Tyr Gin Arg Gin Gly Asp Gly Ser Tyr 
1220 1225 1230 

Ala Pro Val Ala Thr Val Ser Thr Ser Gly Asp Ser Ala Ala He Gly 
1235 1240 1245 

Glu Val Pro Ala Gin Gly Pro Ala Asn Ser Pro His Ala Thr He Ara 
-1250 1255 1260 

Val Thr Val Pro Val Pro Ala Gly Phe Ser Ser Val Thr Tyr Aro Val 
1265 1270 1275 . 1280 

Ala Ala Gin Asn Glu Asp Gly Gin Ala Val Thr Asn Pro Leu Thr Leu 
1285 1290 1295 

Ser Leu Ser Lys Lys- 
1300 



(2) INFORMATION FOR SEQ ID N0s3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 bases 

(B) TyPEs Nucleic Acid 

(C) STRANDEDNESS: Double Stranded 

(D) TOPOLOGY: Linear 

(ii) MOLECDLAR TITPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TAGAAGATIG TIGGGTCIII IGG6AAI6AG AACTGGAAGT CIGAIAC 47 

(2) INFORMATION FOR SEQ ID NO:4: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 bases 

(B> TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double Stranded 

(D) TOPOLOGY: Linear 



wo 92/19744 



PCr/FI92/00138 



-48- 

(11) MOLECOLAR TYPE: Nucleic Acid 

(xl) SEQUENCE DESCRIPTION: SBQ ID NOs4s 

CA6TCCIGCT GI6TICC6AA ICCIGGGTC6 AT 32 



(2) INFORHATION FOR SEQ ID N0:5: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 52 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESSs Double Stranded 

(D) TOPOIiOGTs Linear 

(U) KOLECOLAR TYPE: Nucleic Acid 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCIAXGTTIC GGTTITA6AA IGGGTCIGGCA TIATCTCITA IATIACI606 T6 52 



(2) INFOR2SATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 Amino Acids 

(B) TYPE: Amino Acld 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(U) IK)LECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TZR ASP THR ALA ASP TYR PBE LYS 
1 5 8 

(2) INFORMATION FOR SEQ ID NO:7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 Amino Adds 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 7s 

LEU VAL SER ASP PHB GLN PHE SER PHB PRO Xaa ASP PRO THR ILE PHE 

1 5 10 15 

TYR 
17 
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(2) ZNFORM21TION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) HOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

ILE ASP PRO GLY PRE GLY THR GLN GLN ASP Xaa LEU ASN LEU VAL 6LN 
1 5 10 15 

ALA 
17 



(2) INFORKATION FOR SEQ ID NO: 9s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 Amino Acids 

(B) TYPE: Amino ACid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

HIS ALA VAL ILE TYR GLU ILE MET PRO ASP PRO PHE TYR ASN GLY ASN 
15 10 15 

ILE ALA 
18 



(2) INFORKATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

LEU ASP TYR LEU LYS 
1 5 
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(2) IHFORHATION FOR SEQ ID NOsll: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 15 Amino Acids 

(B) TTPEs Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECOLAR TYPE: 

(xi) SEQUENCE DESCRXPTIOK: SEQ ID NO: 11: 

SER LEU GLY VAL ASN THR LEU TYR LEU MET PRO VAL PHE GLU ALA 
15 10 15 



(2) INFOBMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECOLAR TYPE: 

(xi) SEQUENCE DESCRIPTIW: SEQ ID NO:12: 

PHE GLY ASN PHE HIS SER ASN GLY 
1 5 8 



(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARAC3SRISTICS: 

(A) LENGTH: 6 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) HOLECOLAR TYPE: 

(xi) SEQUENCE DESC^PTION: SEQ ID NO: 13: 



GLY ILE TYR VAL GLY ALA 
1 5 6 
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WHAT IS CLAIMED IS ; 



!• Substantially pure acid a-amylase DNA. 

2. The substantially pure acid a-amylase DNA of claim 1, 
wherein the seqpience of said DNA is that of SEQ ID No. 1, or a 
biologically active fragment, variant, or functional derivative 
thereof. 

3. A recombinant DNA molecule, wherein said recombinant 
DNA molecule comprises the DNA of claim 2 • 

4- A cloning vector, wherein said cloning vector comprises 
the recombinamt DNA molecule of claim 3. 

5. The cloning vector of claim 4, wherein said recombinemt 
DNA molecule is capsJsle of being expressed. 

6. A host cell transformed with the recombinant DNA 
molecule of claim 3. 

7. The host cell of claim 6, wherein said host cell is a 
bacterial host. 

8. The host cell of claim 7, wherein said host cell is a 
Lactobacillus or a Bacillus. 

9. The host cell of claim 8, wherein said host cell is 
Lactobacillus . 
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10. The host cell of claim 8^ wherein said host: cell is 
Bacillus. 

11. A method for the production of acid oamylase, wherein 
said method comprises: 

(i) providing a nucleic acid molecule comprising the 
nucleic acid seq of SEQ ID No. 1^ or a fragment, variant or 
functional derivative thereof; 

(ii) transforming a host with said molecule; 

(iii) expressing said nucleic acid molecule in said 

host; and 

(iv) isolating the acid a-amylase produced by said 
expression. 

12. The method of claim 11, wherein said host is 
Lactobacillus • 

13. The method of claim 12, wherein said host is Bacillus, 

14. A substantially purified B. acidocaldarius acid a- 
amylase, wherein said acid a-amylase comprises the amino acid 
sequence of SEQ ID No. 2, or a biologically active fragment, 
variant or functional derivative therof . 
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ls. The substantially purified acid a-amylase of claim 14, 
wherein said acid a-aiaylase is produced by a method comprising: 

(i) providing a nucleic acid molecule comprising the 
nucleic acid seq of SEQ ID No. 1, or a biologically active 
fragment, variant or functional derivative thereof; 

(ii) transforming a host with waid molecule; 

(iii) expressing said nucleic acid molecule in said 

host; and 

(iv) isolating the acid • a-amylase produced by said 
expression. 

16. A method for preserving feed or fodder, wherein said 
method comprises the addition of the host of claim 6 to said feed 
or fodder. 

17. The method of claim 16, wherein said host is 
Lactobacillus • 

18. The method of claim 16, wherein said feed is cereal 
grain. 
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B . Acldocaldari us amylase: DNA sequence 

10 20 30 40 50 60 

ill!!! 
CATGTGGGTC6CCTCCTCT6C6ATG66ACATCTCACTTACATTTTAACA6TOT 



70 80 90 100 110 120 

666AAGGGJWVCAA6G6TCTT6A6A6AAAAGAAA6GATTCG6ACAAT6AACGATG^^ 



130 140 150 160 170 180 

CACCATT6AC6TCX;GGAAGGCT6A6C6TATAAT(X:A6GAC6ACAAC6TTA6TGCAATC6T 



190 200 210 220 230 240 

TTGCACAAGAGGA66TTGAT6CAC6T6AAGTGGACG6GT6CCGC6AGCCTTGCGACGACC 

MetHlsValLysTrpThrGlyAlaAlaSerLeuAlaThrThr 

250 260 270 280 290 300 

ATC6TCyiTGTCTGCAGCGCTTCC6TTaX:G6C6TTT6AAAACACCCGTTCCGTCGGCAC6 
IleValMetSerAlaAlaLeuProPheAlaAlaPheGluAsnThrAx^ 

310 320 330 340 350 360 

GAACGCGTTCTGGCGGCGACCGCGGCGGATAGTACTAAOSra^TCGCCC^ 
GluArgValLeiiAlaAlaThrAlaAlaAspSerThrAsnAlaSerProProThrGlyAla 

370 380 390 400 410 420 

TCXXsCIAGGCGGTGGCACAGCAGGAGAGACCTACT 

SerAla61y61y61yThrAlaGly61uThrT3n^SerAsnAlaVal61nIleValSer61y 

430 440 450 460 470 480 

AATGGCTTTCGOtCTGGAAACGGGGCCACGGTCGTCCTCGCTGTGAATGGACTGAACTTG 
AsnGlyPheArgAlaGlyAsnGlyAlaThrValValLeuAlaValAsnGlyLeuAsnLeu 

490 500 510 520 530 540 

AACCCCTC6AGTTTCAA6GTTGTGGTATCGAATAGCCTCAACG6C6TGATCGACGTCACA 
AsnProSerSerPheLysValValValSerAsnSerLeuAsnGlyVallleAspValThr 

550 560 570 580 590 600 

AGTGATTCC6TGCTAACG6GGCCGAATTC6ATTGCCTT6CACCTTCCT6CCGG6GATGAC 
SerAspSerValLeuThrGlyProAsnSerlleAlaLeuHlsLexiProAlaGlyAspAsp 

610 620 630 640 650 660 

GGGCTTGGCGCCGGTACGTACACGGTCACCGTCGAAAGTGGAGGTGATTCGGTCTCGACG 
GlyLeuGlyAlaGlyThrTyrThrValThrValGluSerGlyGlyAspSerValSezrThr 

670 680 690 700 710 720 

a:TTCAGGACAGGGACTTCAGGTCAT6CX:TTATAC6ACGGCC6ACACCATTCAGTGGGA^ 
ProSerGlyGlnGlyLeuGlnValMetProTyrThrThrAlaAspThrlleGlnTrpAsp 

730 740 750 760 770 780 

GGAATCTACA(»TCCGATGGCGC6ATGTACGTGTCAGATCCGAATCCCTCX:CCTGGA 
GlylleTyrThrSerAspGlyAlaMetTyrValSerAspProAsnProSerProGlyGln 

/ SUBSTITUTE SHEET/ 
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790 800 810 820 830 840 

GAGGTGACGATCAGCCTTCGAGOSTACAGCGGGAACXTGAaSA^ 
GluValThrlleSerlieuArgAlaTyrSearGlyAsnLeuIlirLysVa 

850 860 870 880 890 900 

TGGGACACCGC^CAAAACAAGGGTTTCCAGGTaSAGATGTCTCC^ 
TrpAspThrAlaGlnAsnLysGlyPheGlnValGluMetSerProGlyAx^h^ 

910 920 930 940 950 960 

CCGTATCyiGCrCTGGTCTGCCACGATCCCCGCTTaUUiaSGCGGAACGATCT^ 
ProTyrGlnLeuTrpSerAlaThrlleProAlaSerAsnGlyGlyThrlleTyx^ 

970 980 990 1000 1010 1020 

TTTGACATATACGATGGC»CCAGTTTTGCGTGTCTCTCyVGGTGACGGACTGCACACGTCT 
PheAspIleTyrAspGlyThrSerPheAlaC^sLeuSerGlyAspGlylieuHlsThrSe^ 

1030 1040 1050 1060 1070 1080 

GACGAO^TOUVCAACayiTTTCCCGTTGCCCGTGGGGACGGTCAC^ 
AspAspIleAsxiAsnAsnPheProLeuProValGlyThrValThrLeuSerThrl^u 

1090 1100 1110 1120 1130 1140 

6CGAATCCCGGT6ATACGGT6AC6GTCTCOGACCCTGTAGGTGACTTCGCCGGAA6CCAG 
AlaAsnProGlyAspThrValThrValSerAspProValGlyAspPheAlaGlySerGln 

1150 1160 1170 1180 1190 1200 

GATCAACCCAATCACACGGTGATACGGTTTGTCAACTCGTCGGGC^^ 
AspGlnProAsnHlsThrVallleArgPheValAsnSerSeitjlyGluThrAlaAlaThr 

1210 1220 1230 1240 1250 1260 

GTCAATGGGACGAACGOSAGCTGGAACAGaSTGCAGTTCACAGTCCXIACAGAGC^ 
ValAsnGlyThrAsnAlaSerTrpAsnSerValGlnPheThrValProGlnSerLei^ 

1270 1280 1290 1300 1310 1320 

AACGGCTTGTATCGCGTCGAGATCGACACGGTCGCCAAGGACGCGGATGGGGTGGT^^ 
AsnGlyLeuTyrArgValGluIleAspThrValAlaLysAspAlaAspGlyValValAsn 

1330 1340 1350 1360 1370 1380 

gtcgaattggacaggagtgcggagcttattgtagggcctctgcccxk:6tggatgcaagcg 
ValGluLeuAspArgSerAlaGluLeuIleValGlyProLeuProAlaTrpMetGlnAla 

1390 1400 1410 1420 1430 1440 

TATGCACy^TGATTCGTTTCAGGCGTTCTACCGATCGCCTTTCGGAGCCGTGTCCACAGGA 
TyrAlaHisAspSerPheGlnAlaPheTyrArgSerProPheGlyAlaValSerThrGly 

1450 1460 1470 1480 1490 1500 

ACCCX:aVTCACGCTTCGCCTGCGGGCTCCGCTCA6CGTGAAGA6TGCGACGCT 
ThrProIleThrLeiiArgLeuArgAlaPTOLeuSerValLysSerAlaThrLeui^ 

1510 1520 1530 1540 1550 1560 

TGGGGGGCAGCGGAT(»GTCAGGCGAGATCGACCTGCCGATGCA6AAGCTCCAAATGTCG 
TrpGlyAlaAlaAspGlnSerGlyGlulleAspLeuProMetGlnLysLeuGlnMetSer 
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1570 1580 1590 1600 1610 1620 

66AGACGA6TTGGC6CAACAAACC66C6TGC».66ACATCAACX]ACTACAC6T6^ 
GlyAspGlxjOiieuAlaGlnGlnThrGlyValGlnAspIleAsnAspTyrThr^ 

1630 1640 1650 1660 1670 1680 

GTGACCATCCCTGOSmSGATGTGACCACACXCGGGACGATGTG^ 
ValThrlleProAlaAlaAspValThrThrProGlyThrMetTrpTyrGliiPheVal^ 

1690 1700 1710 1720 1730 1740 

GAGACGGACACTGGCCAGGTGGTCTACTAC6ATGACAATGGAGCTCAGCTTGAAGGGCCT 
GluThrAspThrGlyGlnValValTyrTyrAspAspAsnGlyAlaGlnLeuGluGlyl^ 

1750 1760 1770 1780 1790 1800 

GGCCAGGTTGGGTTGTCTTCCGACGGACCGAGCTACCAGATCAGCGTATACGAACGGGGA 
GlyGlnValGlyLeuSerSerAspGlyProSerTyrGlnlleSerValTyrGluArgGly 

1810 1820 1830 1840 1850 1860 

TTTCAGAC6CCAGATTGGCT6AAGCACGCCGTGATCTAC6AAATCATGCCGGATCGGTTC 
PheGlnThrProAspTrpLeuLysHlsAlaVallleTyrGluIleMetParoAspArgPhe 

1870 1880 1890 1900 1910 1920 

TACy^TGGCAATATCGO^CGGAGGAGAATCCGAATACGa^^^ 
TyrAsnGlyAsnlleAlaThxGluGliiAsnProAsnThrGlnLysGlylleTyrValGly 

1930 1940 1950 1960 1970 1980 

GCCGATGGAACGGAGTCATTAGGCCCCATCCAGTTCXA^ 

AlaAspGlyThrGliiSerLeuGlyProIleGlnPheHlsGluAsnTrpAspSerProPro 

1990 2000 2010 2020 2030 2040 

TATGATCOSAATATTCCTCCGTTATCTGATCCCAAAATTGCCAGTCTGCXaGGC;^ 
TyrAspProAsnlleProProLeuSerAspProLysIleAlaSerLeiiArgGlyAsnGly 

2050 2060 2070 2080 2090 2100 

CaATGGAAO^TTGACTTTTTOMAGGTGATTTGCAGGGCATCGA^^^ 
GlnTrpAsnlleAspPhePheGlyGlyAspLeuGlnGlyXleGluAspLysLeuAspTyr 

2110 2120 2130 2140 2150 2160 

CTGAA6AGCCTTG6AGTCAATACGCTGTATCTGATGCCCGTCTTTGAGGCG6AATCCAAT 
LeuLysSerLeuGlyValAsnThrLeuTyrLeuMetProValPheGluAlaGluSerAsn 

2170 2180 2190 2200 2210 2220 

CACAAATATGACACAGCCGACTATTTCAAGATTGACCCTGGATTTGGAACGCAGCAGGAC 
HisLysTyrAspThrAlaAspTyrPheLysIleAspProGlyPheGlyThrGlnGlnAsp 

2230 2240 2250 2260 2270 2280 

TGGCT6AATCTCGTACAGGCTGCGCACGCGAAGGGGTTCCATATCATTCTCGACGGGGTG 
TrpLeuAsnLeuValGlnAIaAlaHlsAlaLysGlyPheMlsIlelleLeijAspGlyVal 

2290 2300 2310 2320 2330 2340 

TTCGAAGATACOKSAAGTGACyVGCGTATATTTCAACaVAGTTCGGGAACTTCCACTCCAAC 
PheGluAspThrGlySerAspSerValTyrPheAsnLysPheGlyAsnPheHlsSerAsn 
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2350 2360 2370 2380 2390 2400 

66TGC6T66CA66C6TACCn!GAAGAACCA6(XGTC6CTGTCGCX:CTACTACTC6T^^ 
GlyAlaTxpGlnAlaTyrLeuLysAsnGlnProSerlieuSerProTyrTyrSerTx^ 

2410 2420 2430 2440 2450 2460 

6TGTGGACAGGGAACACCTCAAACCC»TACGATTCGTGGTTTCAGATOTACA(^ 
ValTrpThrGlyAsnThrSerAsnProTyrAspSerTrpPheGlnlleAspThrLeuPro 

2470 2480 2490 2500 2510 2520 

CTTAO^GACACGTCGAACXXXX^CTATCAGCGATTCGTGTATGGGAGCGACAA^ 
LeuThrAspThrSerAsnProAlaTyrGlnArgPheValTyrGlySerAspAs^ 

2530 2540 2550 2560 2570 2580 

GCGCGTGTGTGGATCCGGGAAGGTGCGGACGGATGGCGCTTGGACraXXX^ 
AlaArgValTrpIleArgGluGlyAlaAspGlyTrpArgLeuAspSerAlaAspAsnGly 

2590 2600 2610 2620 2630 2640 

AATTTCAACACGGCATGGTGGGGTGGCTTTCGGCAGGC^ 

AsnPheAsnThrAlaTrpTrpGlyGlyPheAi^lnAlaValLysSerlleAspProAsni 

2650 2660 2670 2680 2690 2700 

GCAGCGATCATCGGCGAGATCTGGGACAATGCGACGAATGACAATGGAAC^ 
AlaAlallelleGlyGluIleTrpAspAsnAlaThrAsnAspAsnGlyThrAspTrpLeu 

2710 2720 2730 2740 2750 2760 

ACGGGATC6ACCTTCGACAGTGTAATGAACTACCA6TTCCGGAACGCCGTGATCGACTTC 
ThrGlySexThrPheAspSerValMetJ^nTyrGlxiPheArgAsnAlaValllei^ 

2770 2780 2790 2800 2810 2820 

TTOXK^GGCACGTACAACGACGGAAACGTGCAGCACCACGCCGTCGACGCTGCGGGAT^ 
PheArgGlyThrTjrrAsxiAspGlyAsnValGlnHisHlsAlaValAspAlaAlaGlyPhe 

2830 2840 2850 2860 2870 2880 

AACCAGGAACTGATGCGCCTGTACAGTGAATATCCTCTGCAGTCGTTCTACTCGATGATG 
AsnGlnGluLeuMetArgLeuTyrSerGluTyrProLeuGlnSerPheTyrSerMetMet 

2890 2900 2910 2920 2930 2940 

AACCTTGTCGATTCGCAAGACACCATGCGGATCCTGACCATCTTGGAGAACGCGCCGCAG 
AsnLeuValAspSerGlxiAspThrMetArglleLeuThrlleLeuGltiAsnAlaP^ 

2950 2960 2970 2980 2990 3000 

CCy^GGCGATCTATCCGCGCTCCAGCAGGATGAGTACAGGCCGTCTCCTGCGGCTGAACAG 
FroGlyAspLeuSerAlaLeuGlnGlnAspGluTyrArgProSerProAlaAlaGluGln 

3010 3020 3030 3040 3050 3060 

TTGGGGATCGAGAGGCTGAAGCTT6TATO;GACTTTC^TTCAGCTTCa:GGGCGATC^^ 
LeuGlylleGluArgLeuLysLeuValSerAspPheGlnPheSerPheProGlyAspPro 

3070 3080 3090 3100 3110 3120 

accatcttctacggcgacgaggc»gggctcactggttattcggatcxxx:tcaatc6tcgg 

ThrllePheTyrGlyAspGluAlaGlyLeuThrGlyTyrSerAspProLeuAsnArgArg 
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3130 3140 3150 3160 3170 3180 

ACCTATCC6T6G6ACAACCA6AATCTC6ATCTa:TGAACCACTACCGCM 
ThrTyrProTrpAspAsnGlnAsnLeuAspI^uLeijAsnHlsTyr^^ 

3190 3200 3210 3220 3230 3240 

ATTCGAAACGCCAATCCTGTGCTTCAGACGGGGGATTTCACGCCGTTGTACGC^ 
IleArgAsnAlaAsnProValLeuGlnThrGlyAspPheThrProLeuTyrAlaGlnGly 

3250 3260 3270 3280 3290 3300 

ATGGTGTACGCyiTTTGCAAGGACCATTCGGAATGGGCGAGATGTCTTCGGTGTGCCAGC^ 
MetValTyrAlaPheAlaAxgThrlleArgAsnGlyArgAspValPheGlyValPxo^ 

3310 3320 3330 3340 3350 3360 

GAGGATGCavCGGCCATTGTGGCGATCAACAATCAGAACCAAGCTATOVCCGa^ 
GliiAspAlaThrAlalleValAlalleAsnAsnGlnAsnGlnAlalleThrValThrlle 

3370 3380 3390 3400 3410 3420 

CCGAC6GATGGGAa»5TTGCGGAOGGGTOCAOGATGCTCX?ATGAACTGAACAAC^ 
ProThrAspGlyThrValAlaAspGlySerThxMetLeuAspGluLeuAsnAsziGlnT^ 

3430 3440 3450 3460 3470 3480 

TACAAGGTGCAGAATGGTGGCATTACACTCACGCTGCAATCGTATCAAGGTGCCATTTTG 
TyrLysValGlnAsnGlyGlylleThrLeuThrLeuGlJiSerTyrGlnGlyAlalleLeu 

3490 3500 3510 3520 3530 3540 

6TGACGCC6AGC6ACGGGCCGATGGCTTATCTGCAA6AG6AGGATTCTCA(^ 
ValThrProSerAspAlaProMetAlaTyrLeuGlnGluGluAspSerGlnAsn^ 

3550 3560 3570 3580 3590 3600 

GCGT66ACGCCTGTGCAAGGTGCCATC6GTTATCGC6TCTGGAGACAGAATCCGAA 
AlaTrpThrProValGlnGlyAlalleGlyTyrArgValTrpArgGlnAsnPrx^ 

3610 3620 3630 3640 3650 3660 

CAATGGGTGCCCTTTGGACCTGT6CTTCCTGCCACGGACTTGAGTGTCACGGTGGAACGC 
GlnTrpValProPheGlyProValLeuProAlaThrAspLeuSerValThrValGluArg 

3670 3680 3690 3700 3710 3720 

GATGCATATGCGCAAACGTTTGCTGTACAAGCGCTGTTTTCGGCGTCTGATCACGCCCAG 
AspAlaTyrAlaGlnThrPheAlaValGlnAlaLeuPheSerAlaSerAspHlsAlaGln 

3730 3740 3750 3760 3770 3780 

TCTCCGGTGTCGGCACCTAAGACGGTATCGCTTCCCGTCGATGTGCCCGCGGTACGCCTG 
SerProValSerAlaProLysThrValSerLeuProValAspValProAlaValArgLeu 

3790 3800 3810 3820 3830 3840 

AGTCA6C3C6ATCGTTAGTGGTCGTGTG6TT6GAGATCGTGCGATGGTCTCGATCACGCCG 
SerGliiProIleValSerGlyArgValValGlyAspArgAlaMetValSerlleThrPro 

3850 3860 3870 3880 3890 3900 

GTTTCAGGCGCGACGCAGTATGTGATCTAOAGAGACy^GGGCGAa^SATCGTATGCTCC^ 
ValSerGlyAlaThrtSlnl^rVallleTyrGlnArgGlnGlyAspGlySeril^ 
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3910 3920 3930 3940 3950 3960 

6TCGC6AC6GTCT(X».CAA6TGGCGATTCC6CA6CTATAGGGGAA6TTCCTGC(^ 
ValAlaThrValSerThrSer61yAspSerAlaAlaIleGlyGluValPxx)Al^^^ 

3970 3980 3990 4000 4010 4020 

CCGGCClihCTCGCCTCACGCGh^ 

ProAlaAsnSerProHlsAlaThrlleArgValThrValProValProAlaGlyPheSer 

4030 4040 4050 4060 4070 4080 

TC66T6Aa:TAaXKX7TGGCrrGCGCAAAAC6AAGATGGGX:^ 
SerValThxiTyrArgValAlaAlaGlnAsnfiGluAspGlyGlnAlaValThrAsn^^ 

4090 4100 4110 4120 4130 4140 

ACCCTATCGCTCTCGAAAAAGTGATGCCTCGCGAAAAGGGCTATCGGAATTTT^ 
ThrLeuSerLeuSerLysLys 
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